Hacker News new | ask | show | jobs
by bak3y 93 days ago
Exactly what I came to say, alerts need tuning if you're having to check your monitoring tools by hand.
1 comments

I read the article as a way for AI to check, classify and potentially partial fix the alerts you see when logging-in in the morning.

And for many alerts you need to look at other events around it to properly classify and partially solve them. Due to that you need to give the AI more then just the alerts.

Through I do see a risk similar to wrongly tuned alerts:

Not everything which resolves by itself and can be ignored _in this moment_ is a non issue. It's e.g. pretty common that a system with same rare ignoble warns/errs falls completely flat, when on-boarding a lot of users, introducing a new high load feature, etc. due the exactly the things which you could fully ignore before hand.