Y
Hacker News
new
|
ask
|
show
|
jobs
by
mhitza
67 days ago
If you self-host an LLM you'll learn quickly that even batching, and caching can affect determinism. I've ran mostly self-hosted models with temp 0 and seen these deviations.