|
|
|
|
|
by eldenring
467 days ago
|
|
> The context window can be compared to working memory in humans: it’s fast, efficient but gets rapidly overloaded. Humans manage this limitation by offloading previously learned information into other memory forms, whereas LLMs can only mimic this process superficially at best. This is just silly. Humans forget things all the time! If I want to remember something I write it down. > The nature of hallucination is very different between AR models and humans, as one has a world model and the other doesn’t. I stopped reading at this point. There's not much signal here, just basic facts about LLMs and then leaps to very bold statements. Here is an interesting experiment I use to help people understand next token prediction. Think of a simple math problem in your head, maybe 3 digit by 2 digit multiplication. Then speak out every single thought you have while solving it. |
|
I do it all in images and I think many other people do too.