Hacker News new | ask | show | jobs
by camel-cdr 1149 days ago
For anybody interested in this, here is an article discussing a very similar problem using arm neon intrinsics, also using the interleaved loads: https://branchfree.org/2019/04/01/fitting-my-head-through-th...