|
|
|
|
|
by savant_penguin
774 days ago
|
|
1. Incorrect. The output of the decoder LLM is the probability distribution of the next token given the input text. Temperature=0 means that the output distribution is not pushed to be closer to a uniform distribution. The randomness comes from the sampling of the next token according to the output distribution to generate text. If you want determinism you always get the argmax of the distribution. |
|
Temperature = 1 is where it's not pushed in either direction.