In place merge sort exists. It's hard to write tho
https://sites.google.com/site/algoxy/home/elementary-algorit...
https://github.com/nikomatsakis/rayon/pull/379
So, for a dual-core with HT 8.26s vs 4.55s