Y
Hacker News
new
|
ask
|
show
|
jobs
by
lend000
394 days ago
Not a bad idea for next generation models, especially since the state of the art already uses Mixture of Experts.