Hacker News new | ask | show | jobs
by jampekka 519 days ago
Decoding (sampling) uses (pseudo) random numbers. Otherwise same prompt would always give the same response.

Computing entropy generally does not.

See e.g. https://huggingface.co/blog/how-to-generate

1 comments

Sure - but that's not the output of the model itself, that's the process of (typically) randomly sampling from the output of the model.
Right, sampling from a model, also known as *inference* (for LLM's).

The inference here is perhaps less pure than what you refer to but you're talking to human beings; there's no need for heavy pedantry.