Hacker News new | ask | show | jobs
by bick_nyers 437 days ago
MoE inference wouldn't be terrible. That being said, there's not a good MoE model in the 70-160B range as far as I'm aware.