Hacker News new | ask | show | jobs
by mcharezinski 2216 days ago
Shooting ourselves in the foot was the number one cause of downtime.

What do you mean?

1 comments

The biggest cause of downtime was deployment. Either deploying code with a bug that wasn't caught in testing, or changing a real-time configuration parameter that wasn't properly scoped.

As opposed to a scaling issue that showed up later, or a node failure or all the other things that could cause downtime.