Hacker News new | ask | show | jobs
by janwas 1480 days ago
Oh, thanks for pointing that out. Golly, Sandy Bridge is a bit old, yes - but still the result is surprising.

djb reports 8000 cycles for int32 x 256 - this is much slower than we benchmark in bench_sort.cc, even for AVX2 (which he confirms is being reached). Not sure what's going on.