Hacker News new | ask | show | jobs
by ronyfadel 1014 days ago
It still does a much better job at translation than llama 2 70b even, at 6.7b params
1 comments

If it's MOE that may explain why it's faster and better...
MOE?