Hacker News new | ask | show | jobs
by adrtessier 3843 days ago
I think generally when every alarm bell in your monitoring system goes off the first thing you do is question whether monitoring is broken. When you confirm there is a problem this big, you panic and try to fix it really fast. Then you call your other on-call guys and tell them you actually have an "oh, shit" situation.

Once you recognize there's a serious problem, THEN you make the public announcement. Ah, the life of ops.