Hacker News new | ask | show | jobs
by pico_creator 615 days ago
Exactly, it covered in the article that there is a segmentation happening via GPU cluster size.

Is it big enough for foundation model training from scratch = ~$3+ Otherwise it drops hard

Problem is "big enough" is a moving goal post now, what was big, becomes small

1 comments

so why not buy up all the little h100s and enough together for a cluster? seems like a decent rollup strategy?

ofcourse it woudl still cost a lot to do... but if the difference is $2/hr vs $4.49/hr then there's some size where it makes sense

Only if they're networked with Infiniband.
Makes sense, though only folks like runpod / sfcompute / etc, have enough visibility to maybe pull this off?

Its a risker move - then just taxing the excess compute now, and print money on the margins from bag holders

Correct me if I'm wrong, but if I recall, neither of those two companies own their own compute. They are marketplaces.
Yup, but they at-least know where all these "small unused clusters" are.

Bag holders, do not want to be shouting to the world they are bag holders.

I think sfcompute does own a lot or most of the current compute on their platform? Not entirely sure though.