Hacker News new | ask | show | jobs
by freakynit 58 days ago
I was hoping a lot from it... but this one, is not up to that mark. For example, here is it's comparion with 4.7x smaller model, qwen3.7-27b.

https://chatgpt.com/share/69f239e8-7414-83a8-8fdd-6308906e5f...

Tldr: qwen3.6-27b, a 4.7x smaller model, have similar performance.

2 comments

To be fair MoE from Qwen itself had the same "problem". 3.5 122B MoE was same or worse than 3.5 27B. Yet to see 122B 3.6.

UPD. NVM, Mistral Medium 3.5 is dense. So yes, it is worse in every way.

That's a chatgpt summary. Actual usage would a better test.
yep.. until then, this is good enough since the tests are standard, and the results are numeric and can be compared without any doubt.