| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by patrickas 1172 days ago
	LLMs do have explicit world models that can be even manipulated. There are many recent papers on the subject.

1 comments

hammyhavoc 1172 days ago

ChatGPT itself tells me it has no explicit world model.

link

patrickas 1172 days ago

It is not about what the model tells you.

This paper shows an emergent world model in an LLM that was taught to play otello moves https://ar5iv.labs.arxiv.org/html/2210.13382

https://arxiv.org/pdf/2303.12712.pdf This paper discusses (among other things) how a GPT4 model navigated between rooms in a text adventure game and was able to create a map afterward. Literally building a model of the world as it was navigating and drawing a map of that afterwards

link

NumberWangMan 1172 days ago

I mean, just like you can create 1-line python script that claims "I am an AGI" and have that fact be false, you can have ChatGPT tell you it has no explicit world model, while exhibiting behaviors that can only really be explained by it having some sort of model of the world inside it.

Fine-tuning is like a PR agent teaching someone what sorts of things not to mention on TV even though they may be true.

link

hammyhavoc 1172 days ago

What are some of these behaviours?

link

NumberWangMan 1172 days ago

Patrickas responded with some examples: https://news.ycombinator.com/item?id=35966307

link