| HN Mirror

You're talking about simple compression and encoding mechanisms and by implication you're drawing an analogy to an LLM encoding/compressing the information..

And sure, it does, but the person you're replying to was trying to understand why it also seems to reason about the query to give an answer consistent with it, despite not being trained on that query or answer. Your answer seems to imply that its just another slick complex encoding.

But the emergent property of trillions of digital neurons predicting the next token is that in the process of being trained to do so, they can also learn to reason.

At some scale, it is efficient to encode cognition which is capable of mimicing the cognition which generated the input tokens.