|
|
|
|
|
by smaddox
982 days ago
|
|
> And that's because, despite all of its training data, it's not capable of actually reasoning. Your conclusion doesn't follow from your premise. None of these models are trained to do their best on any kind of test. They're just trained to predict the next word. The fact that they do well at all on tests they haven't seen is miraculous, and demonstrates something very akin to reasoning. Imagine how they might do if you actually trained them or something like them to do well on tests, using something like RL. |
|
How do you know GPT-4 wasn't trained to do well on these tests? They didn't disclose what they did for it, so you can't say it wasn't trained to do well on these tests. That could be the magic sauce for it.