Hacker News new | ask | show | jobs
by Scaevolus 1522 days ago
If you just want calls and returns, can't you use one of the other PMUs for that? Or is sampling at the "1 sample per event" level higher overhead than IPT?
1 comments

do you mean configuring the other PMUs to interrupt the core every function call / return?

If yes, then yes that is much much higher overhead than processor trace.

It's worth noting that aside from the overhead, function call / returns are not quite enough to reconstruct the callstack: tailcalls are just regular branch instructions.