Hacker News new | ask | show | jobs
by daemonologist 75 days ago
The implication is that there is (should be) a major speed difference - naively you'd expect the MoE to be 10x faster and cheaper, which can be pretty relevant on real world tasks.