Hacker News new | ask | show | jobs
by crawdog 1840 days ago
Doing back of the napkin math you could accomplish this with 770 cloud instances in Google Cloud all running 8 attached GPUs. Now running lustre in the cloud is a pita so you would have to find out how to take advantage of cloud native storage. Running Rapids should be possible using Dataproc. Anyone have the budget to kick off the largest cloud native HPC workload?
2 comments

The interconnect on a tightly-coupled machine like this means it performs substantially better than the naively-equivalent distributed compute resources.
google cloud does not have high performance networking. You want azure or maybe AWS if you wnt high performance networking.