Hacker News new | ask | show | jobs
by lend000 394 days ago
Not a bad idea for next generation models, especially since the state of the art already uses Mixture of Experts.