14
Training GPUs
0:00
Time (min:sec)
2
Inference GPUs
Training
Inference
Idle
Steady state: 14 GPUs running training jobs, 2 GPUs serving inference.
0:00 Burst starts Peak inference Scale down Recovery