|
|
|
|
|
by Turskarama
163 days ago
|
|
Have you ever had a concept you wanted to express, known that there was a word for it, but struggled to remember what the word was?
For human thought and speech to work that way it must be fundamentally different to what an LLM does. The concept, the "thought", is separated from the word. |
|
We force this residual stream to project to the logprobs of all tokens, just as a human in the act of speaking a sentence is forced to produce words. But could this residual stream represent thoughts which don't map to words?
Its plausible, we already have evidence that things like glitch-token representations trend towards the centroid of the high-dimensional latent space, and logprobs for tokens that represent wildly-branching trajectories in output space (i.e. "but" vs "exactly" for specific questions) represent a kind of cautious uncertainty.