Moore's law I think is widely misunderstood as computer chips doubling in speed every two years (or something to that effect). In fact, what it says is that the number of transistors doubles every two years, which is still roughly true. In 2020, its hard to buy single-threaded CPU speed by just throwing more transistors on a chip, which is why most performance gains have been found in multithreaded workloads (because you can trivially get more CPU cores by putting more transistors on a chip).
I would love if I could find where my OS could collect informations on how long my CPUs stall for lack of proper execution ports, lacking parallelism in the code, L1-3 cache misses and so on.