|
|
|
|
|
by hinkley
461 days ago
|
|
> In one particular case at Google, a software controller–acting on bad feedback from another software system–determined that it should issue an unsafe control action. It scheduled this action to happen after 30 days. Even though there were indicators that this unsafe action was going to occur, no software engineers–humans–were actually monitoring the indicators. So, after 30 days, the unsafe control action occurred, resulting in an outage. Isn't this the time they accidentally deleted governmental databases? I love the attempt at blameless generalization, but wow. |
|
https://cloud.google.com/blog/products/infrastructure/detail...