| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ptidhomme 197 days ago
	Those billion parameters, they are a model of the world. Autocomplete is such a shortsighted understanding of LLMs.

2 comments

Peteragain 191 days ago

Sorry for the late response. Yes that is Hinton's argument, and the claim made by the believers. On the other hand, if the GAC explanation is correct, an explanation might be that what we humans write down (that is, the training corpus) is a model of the world, and LLMs reconstruct (descriptions of) human understanding.

link

ptidhomme 186 days ago

Now of course, the only input LLMs have is human text (for text only LLMs anyway). So their model is entirely dependent on how we see the world. I wouldn't restrict LLMs to description of human understanding. They can articulate concepts in a rather sensible way, that wouldn't exist as is in the training corpus. Which exactly means that they have a model, however limited or imperfect.

link

Peteragain 185 days ago

"they can articulate concepts.. that [don't exist] in the training corpus" yes, but that doesn't necessarily mean they have a model [of the world]. You might want to say they are articulating the plausible (that is something that fits with our model of the world) but I think they are producing plausible articulations that we interpret against our model.

link

patrickmay 196 days ago

They're a model of language, not of the world.

link

ptidhomme 196 days ago

A model of language is a model of the world, else it being pure gibberish.

link

marcosdumay 196 days ago

A model of language is a model of a tiny specialized part of the world: language.

And if anybody gets annoyed that my comment is tautological, get annoyed by the people that made the comment necessary.

link

ptidhomme 196 days ago

When you ask an LLM a question about cars, it needs an inner representation of what a car is (how imperfect it may be) to answer your question. A model of "language" as you want to define it would output a grammatically correct wall of text that goes nowhere.

link

marcosdumay 196 days ago

A map of how concepts relate in language is not a model of the world, except on the extremely limited sense that languages are part or the world.

And yeah, that wasn't clear before people created those machines that can speak but can't think. But it should be completely obvious to anybody that interacts with them for a small while.

link

ptidhomme 196 days ago

"How concepts relate" is called a model. That it uses language to be interacted with is irrelevant to the fact that it's a model of of a worldly concept.

What of multi modal models according to you ? Are they "models of eyesight", "models of sound", or pixels or wavelengths... C'mon.

link