Genuine question, but why not set the temperature to 0? I do this for non-code related inference when I want the same response to a prompt each time.
[1] https://thinkingmachines.ai/blog/defeating-nondeterminism-in...
[1] https://thinkingmachines.ai/blog/defeating-nondeterminism-in...