| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by madisonmay 45 days ago
	LLMs are not inherently non-deterministic during inference. I don't believe non-determinism implies lack of abstraction. Abstraction is simply hiding detail to manage complexity.

1 comments

danpalmer 45 days ago

Non-determinism is configurable at the level of the mathematical model, but current production systems do not support deterministic evaluation of LLMs.

link

orbital-decay 45 days ago

They do, though. Providers don't because batching makes it cheaper. Among the providers, DeepSeek seems to support it for v4 (and have actually optimized their kernels for batching), and Gemini Flash is "almost deterministic".

link

danpalmer 45 days ago

I'm pretty sure that the determinism issue is at the floating point math level, or even the hardware level. Just disabling batching and reducing the temperature to 0 does not result in truly deterministic answers.

link

orbital-decay 45 days ago

FP math itself is deterministic on real hardware, if the order of operations stays the same. Output reproducibility is much less of a problem than it seems, see for example https://docs.vllm.ai/en/latest/usage/reproducibility/

link

nnevatie 45 days ago

The FP math is deterministic. However, the environments in which inference is run and specifically batching make current LLM services practically non-deterministic.

link