Specifically, optimized assembly instructions are used now in crypto code and also lot of optimizations have gone into the ARM assembler.
One example - https://gist.github.com/carlosedp/f85274ef2a9bacc773cf8ddeed....