If performance matters, you should experiment with __builtin_prefetch, which is available in clang and GCC.