Hacker News new | ask | show | jobs
by haricm 4678 days ago
The original OpenCV code already has intrinsics in many portions of the code. But enabling them results only in a 10% improvement.

We decided to report non-intrinsics version, because reporting the original OpenCV numbers with intrinsics as SSE optimized would be unfair to Intel. Apparently its not very well optimized.

My own guess is that if we add intrinsics for Intel to our own C code, it will boost by around 2x. We could have written a blog without reporting the Intel C optimized numbers, but that would have been unfair to Intel again.