Hacker News new | ask | show | jobs
by mmoskal 417 days ago
Also ~noone runs h100 at home, ie at batch size 1. What matters is throughput. With 37b active parameters and a massive deployment throughout (per gpu) should be similar to Gemma.