The 4NPS gives the best performance, followed by 2NPS, followed by non-NUMA. This surprised me as well.