|
|
|
|
|
by jbellis
58 days ago
|
|
For coding, qwen 3.6 35b a3b solved 11/98 of the Power Ranking tasks (best-of-two), compared to 10/98 for the same size qwen 3.5. So it's at best very slightly improved and not at all in the class of qwen 3.5 27b dense (26 solved) let alone opus (95/98 solved, for 4.6). |
|
https://blog.brokk.ai/introducing-the-brokk-power-ranking/