Hacker News new | ask | show | jobs
by dmlittle 236 days ago
The node failure rate is much higher than that. On a 1M node cluster of cloud-managed instances (AWS, GCP, Azure, etc.) you'd likely see failures a few times a month, if not more.
1 comments

Yep. And the chances that the DB node with the control plane fails are therefore less than one in ten thousand.