Hacker News new | ask | show | jobs
by marcan_42 1875 days ago
Probably because they have an 8-wide decoder and a massive reorder buffer, so they can actually keep the backend fed.

The problem with x86 is decoding is hell and requires increasingly large transistor counts to parallelize, so you end up with a bottleneck there. ARM doesn't have that problem.

1 comments

This the single largest driver of M1 performance.

Variable length, over lapping instructions has made x86 instruction decoding intractable. The obvious answer is make it tractable, the unobvious answer is how to do that and hopefully remain backward compatible.