Hacker News new | ask | show | jobs
by basch 109 days ago
Which you essentially cannot do

It is inherently randomized

1 comments

But isn't the link shared by that comment doing exactly that

https://sulbhajain.medium.com/why-llms-arent-truly-determini...

>The Thinking Machines research team showed it’s possible to fix this. They built batch-invariant kernels for RMSNorm, matrix multiplication, and attention, integrating them into the open-source inference engine vLLM.

>The outcome: 1,000 identical prompts, 1,000 identical outputs. Perfect reproducibility.

??