Hacker News new | ask | show | jobs
by krona 1159 days ago
There's nothing requiring non-determinism in these models, it's just there as a means to increase variety. There are obviously scenarios where this makes less sense.
1 comments

Kinda, there are GPU kernels that are sped-up by being non-deterministic, so you also gain performance with non-determinism