|
|
|
|
|
by Habgdnv
810 days ago
|
|
Unless you screw something, a different next token does not mean wrong answer. Examples: (80% of the time) The answer to the expression 2 + 2 is 4 (15% of the time) The answer to the expression 2 + 2 is Four (5% of the time) The answer to the expression 2 + 2 is certainly (95% of the time) The answer to the expression 2 + 2 is certainly Four This is how you can asp ChatGPT the same question few times and it can give you different words each time, and still be correct. |
|
I think a more correct explanation would be that increasing temperature doesn't necessarily increase the probability of a truly incorrect answer proportionately to the temperature increase (because the same correct answer could be represented by many different sequences of tokens), but if the model assigns a non-zero value to any incorrect output after applying softmax (which it most likely does), increasing the temperature does increase the probability of that incorrect output being returned.