|
|
|
|
|
by hangsi
650 days ago
|
|
The common method for choosing the next output token for an LLM is sampling from a Boltzmann distribution. If you have seen the term "temperature" in the context of language models, that is a direct link to the statistical gas mechanics. |
|