Hacker News new | ask | show | jobs
by yanslookup 1131 days ago
> you can deploy hundreds of GPUs simultaneously.

> with over 2600 of them available for deployment.

guessing you mean 2600 in total.

FWIW we ran a workload recently on AWS that required a few thousand g4 instances in a single AWS region. We ended up scavenging and using g3s as well due to capacity constraints.

3 comments

That's quite an impressive workload!

If you're using our on-demand service and intend to terminate the machines once your work is completed, we currently have 2600 available GPUs. However, if you have an ongoing need for these machines, we also have reserved instances with additional stock, which brings our total capacity to an estimated 7000 GPUs as of today.

But of course these numbers could easily change in the future.

I think they mean that almost 2600 are literally available right now for you to rent - or if you requested them, then you would get them right now, and then there would - for a while - be 0 that are available, because you have them all.
Out of interest, was this workload for training or serving?