Y
Hacker News
new
|
ask
|
show
|
jobs
by
bick_nyers
437 days ago
MoE inference wouldn't be terrible. That being said, there's not a good MoE model in the 70-160B range as far as I'm aware.