Hacker News new | ask | show | jobs
by anko 1495 days ago
cool stuff! I was trying to do something similar on Arm Neon (m1 pro) with the CNT instruction https://developer.arm.com/documentation/ddi0596/2020-12/SIMD...

I believe it does 128 bits per instruction, but I'm still struggling with rust w/ asm.

Along my journeys, however, I found this repo https://github.com/WojciechMula/sse-popcount/ which has tons of competing simd implementations for both intel and arm.