Hacker News new | ask | show | jobs
by cperciva 6582 days ago
it's merely operating on a chunk of 4 bytes at a time

If you have a modern CPU, that code operates on 8 bytes at a time. :-)