Hacker News new | ask | show | jobs
by mhitza 67 days ago
If you self-host an LLM you'll learn quickly that even batching, and caching can affect determinism. I've ran mostly self-hosted models with temp 0 and seen these deviations.