|
|
|
|
|
by tsurba
388 days ago
|
|
1.5-2 years ago I did some training for a ML paper on 4 AMD MI250x (each is essentially 2 gpus so 8 in total really, each with 64GB VRAM) on LUMI. My Jax models and the baseline PyTorch models were quite easy to set up there, and there was not a noticeable perf difference to 8x A100s (which I used for prototyping on our university cluster) in practice. Of course it’s just a random anecdote, but I don’t think nvidia is actually that much ahead. |
|