Hacker News new | ask | show | jobs
by chpatrick 805 days ago
It's possible but you have to make sure that floating point operations always happen in the same order (for example you could operate on blocks concurrently then merge them serially). You also have to be careful with optimizations like FMA because they produce a different result than multiply then add.
1 comments

Are you sure this cross-platform determinism works for GPUs? I can't find any reference about that.