|
|
|
|
|
by dannyw
1183 days ago
|
|
There are some studies showing that LLMs are capable of representing internal state and knowledge in its models, even when only trained on language tokens: https://thegradient.pub/othello/ > Back to the question we have at the beginning: do language models learn world models or just surface statistics? Our experiment provides evidence supporting that these language models are developing world models and relying on the world model to generate sequences. Let’s zoom back and see how we get there. The GP's comment suggests that ChatGPT-4* has not internalized this (effectively) for Chess. * Just like how ChatGPT-3.5 is not GPT-3.5 (text-davinci-003), ChatGPT-4 is probably not the only GPT-4 model that will be released. |
|
In a similar vein, it is almost possible to adjudicate Diplomacy orders looking only at the orders and never the map.
Given sufficient interest, complex enough board games tend to converge on the same basic notational principles.