I did use GOMAXPROCS with the number of logical cores that I have, and I did test the node cluster with the same number.