| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by nl 2 days ago

> Rented out GPUs are likely not a similar use profile as compute used for training LLMs. The latter is likely closer to the cryptocurrency GPUs that are running at full tilt 24/7.

This is untrue.

H100's are used for training (well were, but are now outdated because B100/B200s are much faster).

Most of the reason people rent H100s is for smaller training runs.

If you are doing inference you usually buy managed capacity at Baseten or something, and that is often priced differently (although it comes down to an extra margin on longer term H100 prices basically).

Inference utilization is often actually higher than training now because so much effort has been spent on optimizing that stack.