Hacker News new | ask | show | jobs
by DiabloD3 278 days ago
Same calculation, basically. Any given ~30B model is going to use the same VRAM (assuming loading it all into VRAM, which MoEs do not need to do), is going to be the same size