It goes very far back to MMX: https://yro.slashdot.org/comments.pl?sid=155593&cid=13042922
tldr: Intel's compiler doesn't optimize using standardized instructions on non-Intel hardware.