|
|
|
|
|
by deepGem
475 days ago
|
|
Any idea what the sRAM to uRAM ratio is on these new GPUs ? If they have meaningfully higher sRAM than the Hopper GPUs, it could lead to meaningful speedups in large model training. If they didn't increase the memory bandwidth, then 512GB will enable longer context lengths and that's about it right? No speedups For any speedups You may need some new variant of FlashAttention3 or something along similar lines to be purpose built for Apple GPUs. |
|