| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by pythux 805 days ago

The status page says it's been resolved 8 minutes ago (Apr 05, 2024 - 08:48 UTC): https://www.githubstatus.com/incidents/bnkkbj90yhz6

But it definitely still happens now (500s on refresh on PRs and GitHub actions)

Edit: still ongoing

Edit 2: still ongoing at Apr 05, 2024 - 08:56 UTC (keeping updated for the record since their status page cannot be trusted apparently)

Edit 3: I see they have switched to a different (ongoing) incident ID now: https://www.githubstatus.com/incidents/5ly0psff2s5d

2 comments

s_dev 805 days ago

Status pages can never be trusted.

link

johnchristopher 805 days ago

I recently set up a status page for the services I run on my pi. The idea was to get some insights and apply experience at work.

My experience now tells me what we really need first is a solid alerting system, the status page can't be trusted. It's a PR tool (a useful one), not a sysadmin tool.

link

MichaelZuo 805 days ago

If the status page officially authorized by the company’s management team cannot be trusted, then why trust the company in the first place?

link

johnchristopher 804 days ago

Is the company in the business of selling status pages ?

link

MichaelZuo 802 days ago

If they market it as a feature，yes?

Were you expecting me to say something else?

link

johnchristopher 802 days ago

> If they market it as a feature，yes?

Well, the status page is not the main feature of github/gitlab but I agree that if customers decide to rely on it then it's a problem.

> Were you expecting me to say something else?

Yeah, something like "their status page is not their core business" or "status pages and SLA are two distinct things".

link

joelanman 805 days ago

I always think this should be an incident in itself - why did our status page not reflect the reality of a degraded service? It's so common that they don't, and something user-driven like DownDetector is often more reliable

link

hashworks 805 days ago

I don't think an accurate and automated public status page is something any management would want. If it was accurate they wouldn't be able to lie to customers about the uptime. So I always suspect status pages are adjusted manually.

link

s_dev 805 days ago

That's exactly what happens. How we need to respond though is by not linking to status pages hosted by that party, instead we should be linking to a StatusGator or DownDetector page as a 'source of truth'.

link

omeid2 805 days ago

> why did our status page not reflect the reality of a degraded service?

There was a conflict with marketing, market movement, sla contracts, and our image.

link

compumike 805 days ago

What about something like https://heiioncall.com/status (disclosure: helped build it) which gives a real-time view into what our monitoring & alerting system sees, both from various HTTP endpoint checks, and cronjob checkins?

But I don’t think most orgs would want a public version of this (too much transparency), which is why we haven’t built that.

link

jll29 805 days ago

=> Third-party sites can never be trusted.

link

jorge-d 805 days ago

=> Nobody can never be trusted

link

tux3 805 days ago

=> I think, therefore I am. Everything else is speculation.

link

eloeffler 805 days ago