|
|
|
|
|
by mike_hearn
559 days ago
|
|
Again, read the papers. They absolutely do know facts, and that can be seen in the activations. Your description is oversimplified. It's easy to get models to emit statistically improbable but correct sequences of words. They are not just looking at what words are near by each other, that doesn't lead to the kind of output LLMs are capable of. |
|
https://en.wikipedia.org/wiki/Mark_V._Shaney