Hacker News new | ask | show | jobs
by astrange 114 days ago
LLMs also don't work by generating probability distributions of the next word. Your explanation isn't able to explain why they can generate words, let alone sentences.
1 comments

That is exactly how they work.
No, a token is not a word.
I mean, it is some text.
How do you get from a piece of text smaller than a word to an entire coherent sentence?