The point is that the monkey was only the trigger. But the full cause of an outage is much more than just the trigger. It's a monkey this time, could have been an operator or a storm...
Digging a bit deeper, they appear to be running with very little margin, if any. Meaning redundancy and protection against cascading failures is limited.
Fixing the real cause is expensive, so it's often easier to just blame the trigger—though admittedly in this case it makes for a great headline. :)
OK, in all fairness I did think all we were talking about was the trigger, not the underlying vulnerable system (which I already had understood to be less than solid over the years.)
Digging a bit deeper, they appear to be running with very little margin, if any. Meaning redundancy and protection against cascading failures is limited.
Fixing the real cause is expensive, so it's often easier to just blame the trigger—though admittedly in this case it makes for a great headline. :)