Hacker News new | ask | show | jobs
by m00x 10 days ago
It could be a much bigger MoE model
1 comments

Then it would be slower.