|
|
|
|
|
by skepticATX
341 days ago
|
|
OpenAI’s systems haven’t been pure language models since the o models though, right? Their RL approach may very well still generalize, but it’s not just a big pre-trained model that is one-shotting these problems. The key difference is that they claim to have not used any verifiers. |
|
If you mean pure as in there’s not additional training beyond the pretraining, I don’t think any model has been pure since gpt-3.5.