Hacker News new | ask | show | jobs
by YetAnotherNick 812 days ago
That's the point of MoE. Sacrificing VRAM for compute/RAM bandwidth which makes it harder sell for consumer devices but easier for server devices where things are more likely to be compute or RAM bandwidth bound.