Hacker News new | ask | show | jobs
by smallpipe 1198 days ago
The example is literally one cache line, which probably won't affect the rest of the program too much. But given the average L1 throughput, I'd bet the bitwise version is faster