| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dcreater 292 days ago
	Thank you! Why are the comparisons to llama3.1 era models?

1 comments

lllllm 292 days ago

we compared to GPT-OSS-20B, Llama 4, Qwen 3, among many others. Which models do you think are missing, among open weights and fully-open models?

Note that we have a specific focus on multilinguality (over 1000 languages supported), not only on english

link

kamranjon 292 days ago

How did it compare with Gemma 3 models? I’ve been impressed with Gemma 27b - but I try out local models frequently and I’m excited to boot up your 70b model on my 128gb MacBook Pro when I get home!

link

dcreater 292 days ago

ah im sorry, I missed that - im not that blind usually..

link