|
|
|
|
|
by int_19h
324 days ago
|
|
It's not even a 20b model. It's 20b MoE with 3.6b active params. But it does not actually compete with o3 performance. Not even close. As usual, the metrics are bullshit. You don't know how good the model actually is until you grill it yourself. |
|