|
|
|
|
|
by dzaima
498 days ago
|
|
> there's an open bug ticket noting that it doesn't protect against contraction (https://github.com/llvm/llvm-project/issues/91674). Huh. ¯\_(ツ)_/¯ I didn't really mean the loop thing as much of a problem for the goal of reproducibility (easy enough to just not explicitly request a vector math library). aarch32 NEON does have an implicit FTZ, and, yeah, such are annoying; though gcc and clang don't use it without -ffast-math (https://godbolt.org/z/3b11dW559) I do agree that getting consistent results would definitely make sense as the default. |
|