| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by deadbabe 568 days ago
	They are deterministic at 0 temperature

5 comments

lokhura 568 days ago

At zero temp there is still non-determism due to sampling and the fact that floating point addition is not commutative so you will get varying results due to parallelism.

link

BalinKing 568 days ago

(Disclaimer: I know literally nothing about LLMs.) Wouldn't there still be issues of sensitivity, though? Like, wouldn't you still have to ensure that the wording of your commands stays exactly the same every time? And with models that take less discrete data (e.g. ChatGPT's new "advanced voice model" that works on audio directly), this seems even harder.

link

BalinKing 568 days ago

s/advanced voice model/advanced voice mode/ (too late for me to edit my original comment)

link

wkat4242 568 days ago

They are pretty deterministic then but they are also pretty useless at 0 temperature.

link

ukuina 568 days ago

Not for the leading LLMs from OpenAI and Anthropic.

link

vrighter 567 days ago

Not really, not in practice. The order of execution is non-deterministic when running on a cluster or a gpu, or more than one core of the CPU and rounding errors propagate differently on each run.

link