|
|
|
|
|
by ACCount37
247 days ago
|
|
The answer is that it does know. Not exactly, but the "general shape" of the answer is known to the LLM before the very first token of the answer is emitted! "Next token prediction" is often overstated - "pick the next token" is the exposed tip of a very large computational process. And LLMs are very sharp at squeezing the context for every single bit of information available in it. Much less so at using it in the ways you want them to. There's enough information at "no token emitted yet" for an LLM to start steering the output towards "here's the answer" or "I don't know the answer" or "I need to look up more information to give the answer" immediately. And if it fails to steer it right away? An LLM optimized for hallucination avoidance could still go "fuck consistency drive" and take a sharp pivot towards "no, I'm wrong" mid-sentence if it had to. For example, if you took control and forced a wrong answer by tampering with the tokens directly, then handed the control back to the LLM. |
|
Can you help correct where I'm going wrong?