Hacker News new | ask | show | jobs
by richbell 530 days ago
> I say that because, as soon as Dynatrace agents got installed on the app hosts, we started having various 'heisenbugs' requiring node restarts

Our containers regularly fail due vague LD_PRELOAD errors. Nobody has invested the time to figure out what the issue is because it usually goes away after restarting; the issue is intermittent and non-blocking, yet constant.

It's miserable.

1 comments

We do at least one rolling restart a day because it’s the best way to GC. And we’re not using any APM yet