Y
Hacker News
new
|
ask
|
show
|
jobs
by
lostmsu
58 days ago
To be fair MoE from Qwen itself had the same "problem". 3.5 122B MoE was same or worse than 3.5 27B. Yet to see 122B 3.6.
UPD. NVM, Mistral Medium 3.5 is dense. So yes, it is worse in every way.