|
|
|
|
|
by Gimpei
411 days ago
|
|
Given that o3 is trained on the contents of the Internet, and the answers to all these chess problems are almost certainly on the Internet in multiple places, in a sense it has been weakly trained on this content. The question for me becomes: is the LLM doing better on these problems because it’s improving in reasoning, or is it simply improving in information retrieval. |
|
That's not to say "are you remembering or reasoning" means the same thing when applied to humans vs when it's applied to LLMs.