|
|
|
|
|
by dahart
651 days ago
|
|
I learned relatively recently that trig functions on the GPU are free if you don’t use too many of them; there’s a separate hardware pipe so they can execute in parallel with floats adds and muls. There’s still extra latency, but it’ll hide if there’s enough other stuff in the vicinity. |
|
Do you know if that hardware pipeline works only for these intrinsic variants?