Y
Hacker News
new
|
ask
|
show
|
jobs
by
ufo
1544 days ago
I think the polynomial calculation in the end looks interesting. It doesn't use Horner's rule.
1 comments
lifthrasiir
1544 days ago
It does use Horner's rule, but splits the expression into two halves in order to exploit instruction-level parallelism.
link
jacobolus
1544 days ago
Considering the form of both halves is the same, are compilers smart enough to vectorize this code?
link
adgjlsfhk1
1544 days ago
I might be wrong but I would think for something like this vectorizing wouldn't save time (since you would have to move data around before and afterwards. The real benefit of this is it lets you run two fma operations in parallel.
link