|
|
|
|
|
by athenot
2315 days ago
|
|
No, there's a lot more that goes into handling incidents that affect large production systems. Getting things back up as fast as possible, coordination, communication, getting the right action items out of it. There are tradeoff decisions that need to be made, executives and big customers picking up the phone. This kind of tooling is what arises in an effort to automate and streamline incident response. When you're operating at Netflix' scale, each minute is precious and if a tool manages to save 45 seconds on each incident, it can be quite valuable. |
|
https://rt-wiki.bestpractical.com/wiki/WorkFlow#Modeling_Wor...
I don't have a problem with the work -- like I said, it's a persistent use-case -- it's just the way it's described here, as if it wasn't and with puffery. And the thin-ness of my skin with this is not the issue!