|
|
|
|
|
by semi-extrinsic
4110 days ago
|
|
What do you mean, real-time GPUs? And: what interconnect are you running on? How does your scaling look? Is this just for embarrasingly parallel stuff?
Just curious; I'm running multi-GPUs myself for molecular dynamics. |
|
We started on building fast-start multitenant access to single GPUs and approaching peak on those (full-GPU barnes hut, 10X over Keshav's work). We're now focusing on distributing, and as we are more interested on running on many GPUs for scale out, focusing on communication avoiding. This makes a path to giving companies time on 1000 GPUs (think Pixar-levels of compute) rather than shipping small 8 GPU boxes with infiniband. Via elasticity and time sharing, the analyst hour pricing is unprecedented.
The titan guys run on 20,000 GPUs for similar astronomy codes, so doable. We're making it in more accessible, big-team, and analyst-focused ways. E.g., load, interactively analyze with smart defaults & streamlined common paths, export/report, and share.