Hacker News new | ask | show | jobs
by marklacey 2276 days ago
Because if you statically link libraries like the language runtime it allows you to reorder the blocks in those as well to improve locality and instruction cache utilization.

For example if your code calls a particular runtime function shortly after starting, that function (or the basic blocks from that function that are executed frequently) can be placed close to the call site.

1 comments

If you're statically linking, then LTO would give you cross TU visibility to do the same optimization at link time, not post link when you've lost a lot of fidelity.
Not if it’s a library that you’re not compiling, like language runtimes or system libraries.

LTO requires that you’re compiling all the code to get the benefits.

I’m also not sure if either gcc or clang’s LTO does layout at the basic block level or at the function level. There is a lot of benefit from doing it at the basic block level. You can literally lay out all the code that runs at application start time so that it is mostly adjacent and fall-through from block to block during execution.

This isn’t something you necessarily do instead of LTO, but rather something you can do in addition to it.