Hacker News new | ask | show | jobs
Evolution of Our PagerDuty Playbook: Fewer Alerts, More Uptime (goshippo.com)
18 points by wjarjoui 3595 days ago
1 comments

Why is the time to acknowledge longer than the time to resolve? I would have expected the opposite.
As our system matured, we added redundancy in our core system components and split up non-core core systems. This allowed us to immediately deprovision malfunctioning servers, timeout misbehaving 3rd parties and/or fix-forward quickly, resulting in a really low TTR compared to our TTA (which is also pretty low in the recent months)