Hacker News new | ask | show | jobs
by mgaunard 1477 days ago
It's fairly obvious: the rewrite prevents parallelization because floating-point isn't associative.

You'd need to parallelize it explicitly (which can be done by just unrolling the loop).