Hacker News new | ask | show | jobs
by newswasboring 139 days ago
>there is a bit of non-determinism in batched non-associative math that can vary by batch / hardware

Maybe a dumb question but does this mean model quality may vary based on which hardware your request gets routed to?