Hacker News new | ask | show | jobs
by Filligree 628 days ago
The temperature parameter doesn’t introduce any noise into the network evaluation.

Typically, what happens is that the network outputs a set of possible tokens with different probabilities, and a sampler picks from the top possibilities. Temperature determines how spiky its pick is; at zero it’ll always pick the top option.