Hacker News new | ask | show | jobs
by RaftPeople 53 days ago
> I've had a lot of thoughts and conversations

Do LLM's have thoughts?

When you composed your post, your thought already existed in you head and you chose words that expressed the thought you held in your head.

When LLM's choose words, they choose them on the fly and the end result could be concept X or it could be concept Y, it meandered to a destination.

1 comments

the latent space of the LLM when it chooses each token is 10s or even hundreds of GB for each word that it chooses. It's not really useful to look at LLMs from the perspective of its prediction head which is a very small part of the model.
Late response, was out of town.

Agreed there is significant information in the latent space, but what is missing is a fully resolved "thought" based on that information plus current context plus validation against an internal working model of the world.

Except that latent space does not change in response to new information, something that thoughts famously do. If you read a book that captures the author's thoughts, disagree, and write an eloquent arguments to the author, you might change the author's mind. But you will not change the "book's thoughts" on the subject.

Latent spaces are maps of thoughts other people have had, not the thoughts themselves.

This gets a bit tricky. Over very long task contexts (1M tokens) or with prompt compression (10s of millions of tokens) the model can alter its priors based on updated evidence. This form of knowledge based learning is not necessarily robust, but demonstrably does occur.
"the model can alter its priors"

The model doesn't have high-level priors in the Bayesian sense (though you could have priors about it).

The low-level priors it does have (the weights) are not modified by the context.