|
|
|
|
|
by NitpickLawyer
276 days ago
|
|
> Can anyone explain why that makes sense? Can be anything from different arch, more data, RL, etc. It's probably RL. In recent months top tier labs seem to have "cracked" RL to a level not seen yet in open models, and by a large margin. |
|