Hacker News new | ask | show | jobs
by Someone 164 days ago
> with accesses aligned to cache lines.

You want that, but can be tricky because the from and to regions may have different alignment.

Also, the XOR trick introduces data dependencies. That slows down pipelined CPUs.