Hacker News new | ask | show | jobs
by andrewcooke 5126 days ago
true, but it sounds like they (and perhaps you) have never even heard of the chaos monkey.
1 comments

Randomly killing instances wouldn't have detected this particular failure mode as far as I can see, since the error lay in the inability to resurrect a failed process under certain circumstances.