|
|
|
|
|
by verdverm
51 days ago
|
|
There was a qwen-3.6 MoE six days ago that I thought was better than Gemma 4. Today's is a dense model. (gemma release both a 26B MoE and a 31B dense at the same time) I have intention to evaluate all four on some evals I have, as long as I don't get squirrelled again. |
|