|
|
|
|
|
by gptadmirer
1291 days ago
|
|
But how does probability distribution over sequences of consecutive tokens can create new things? Like, I saw the other day it creates a C code that creates a Lisp code that creates a Pascal code. Is this based on an entirely previous creation? |
|
The code examples are perfect case studies, they don't actually work. They aren't just slightly wrong, they're completely nonsensical.
Another example is "is <number> prime?", it can't answer things like that, and it will make up something that may or may not be accurate.
The model has no concept of what is true or false, it's essentially trying to predict what is the most likely token to come next.
It seems to know stuff because the knowledge comes from the dataset, hence techniques like zero-shot, few-shot and prompt-based learning.