Hacker News new | ask | show | jobs
by dcreater 292 days ago
Thank you! Why are the comparisons to llama3.1 era models?
1 comments

we compared to GPT-OSS-20B, Llama 4, Qwen 3, among many others. Which models do you think are missing, among open weights and fully-open models?

Note that we have a specific focus on multilinguality (over 1000 languages supported), not only on english

How did it compare with Gemma 3 models? I’ve been impressed with Gemma 27b - but I try out local models frequently and I’m excited to boot up your 70b model on my 128gb MacBook Pro when I get home!
ah im sorry, I missed that - im not that blind usually..