Hacker News new | ask | show | jobs
by cddotdotslash 311 days ago
Companies should automate this. Write their own outage monitoring, feed the results, plus the cumbersome format you have to send to the provider, into an LLM, have it spit out an email requesting SLA credits or whatever the contract specifies.

Probably not worth it for low cost services, but if you’re paying GitHub $x millions per year, maybe it is.

3 comments

They intentionally underreport outages. Everybody does. When your performance metrics for your customers, managers, and individual contributors all include uptime, what you get isn't better uptime but lies about uptime.
Some customers of my product, StatusGator, do this with our API. They can extract the outage data -- including the time when we detect the outage before its acknowledged. And then use that to get SLA credits.
Why would I trust you to report

Its great that your specific product does this, but as a whole I have to monitor the service separately to keep you honest (well not you specifically, I'm sure you are honest and do as much as you can to be honest, but not every company is), and of course to monitor the problems I have which you don't detect.

Gotta get a monitor for the monitor, then a manger for the monitors, and a manager to manage the managers of monitors.
I think that was sarcasm
Obviously you should use a SaaS for SaaS uptime monitoring. No need to build this yourself.
You can also self-host something like Prometheus or Uptime-Kuma.