|
|
|
|
|
by Johnny555
745 days ago
|
|
they were playing whack-a-mole with all the alarms caused by the flood That's common in computer monitoring systems, at my last job when we had a serious outage, we'd get dozens of pager alerts, it was hard to figure out the root cause because so many alerts fired that were caused by the root cause. I.e. like if the root cause was a root volume was out of disk space, the "unable to log in" alert was superfluous and not helpful. Eventually we moved to a better system that had a betrer sense of hierarchy for alerts as well as a way to easily silence them. |
|