| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by HarHarVeryFunny 524 days ago
	I think you're confusing training and inference. During training there are things like initialization, data shuffling and dropout that depend on random numbers. At inference time these don't apply.

1 comments

Decoding (sampling) uses (pseudo) random numbers. Otherwise same prompt would always give the same response.

Computing entropy generally does not.

Sure - but that's not the output of the model itself, that's the process of (typically) randomly sampling from the output of the model.

Right, sampling from a model, also known as *inference* (for LLM's).

The inference here is perhaps less pure than what you refer to but you're talking to human beings; there's no need for heavy pedantry.