|
|
|
|
|
by dudeinjapan
973 days ago
|
|
What is missing from this picture is idleness. For example, suppose I have a SLO 10 sec job A and SLO 5 min job B. If I only get a few Bs sporadically, I may want to define queue X=A only, and queue Y=A,B to use the idle compute to process more As. In the wild, this is a delicate balancing act. |
|
You need either dedicated workers for low latency tasks or some sort of preemption to meet SLOs with such heterogeneous tasks.