Y
Hacker News
new
|
ask
|
show
|
jobs
by
DiabloD3
278 days ago
Same calculation, basically. Any given ~30B model is going to use the same VRAM (assuming loading it all into VRAM, which MoEs do not need to do), is going to be the same size