|
|
|
|
|
by TekMol
1090 days ago
|
|
Great, thanks for the clarification. And how does the NN represent the token at the output layer? Is it a binary representation of the token number? Or does it have a neuron for each token it knows and ChatGPT takes the most activated neuron as the answer? |
|
Tokens are part of words, approx 4 characters or 75% of word.
It gives a list of tokens with their probabilities on output.
It's a short list with highest probabilities.
Temperature controls which tokens to pick - usually 0% = top one only (consistent results), closer to 100% means more randomness (more "creativity").