Y
Hacker News
new
|
ask
|
show
|
jobs
by
lifthrasiir
1540 days ago
It does use Horner's rule, but splits the expression into two halves in order to exploit instruction-level parallelism.
1 comments
jacobolus
1540 days ago
Considering the form of both halves is the same, are compilers smart enough to vectorize this code?
link
adgjlsfhk1
1540 days ago
I might be wrong but I would think for something like this vectorizing wouldn't save time (since you would have to move data around before and afterwards. The real benefit of this is it lets you run two fma operations in parallel.
link