|
|
|
|
|
by samspenc
699 days ago
|
|
Fascinating, despite the significantly better specs (and VRAM) on the AMD MI300x, the Nvidia H100 seems to match performance at lower batch sizes, and only loses out slightly at larger batches, I'm guessing the differentiator is mostly VRAM (192 GB in MI300 vs 80 GB in the Nvidia chip.) Does anyone know if this is just due to ROCm vs CUDA implementations? Or something else? |
|