Hacker News new | ask | show | jobs
by marmaduke 564 days ago
My approach to this is to write a bunch of tiny “kernels” which are obvious to SIMD and then inline them all, and it does a pretty good job on x86 and arm

https://github.com/maedoc/tvbk/blob/nb-again/src/util.h