Hacker News new | ask | show | jobs
by virtuallyvivek 2348 days ago
Yeah... the idea was that urgent meant a site outage or customer P0. I'd like to think that would qualify, no?
1 comments

We've heard this many times when people hear we have deep focus scheduled: "You'd better respond or you'll lose customers". But we use teams. The team responds. There's always many more than one person on-call and/or available to respond. If one person is unavailable we have others from the team able to step in.

An outage that can only be resolved using a single hero individual that must always be available to swoop in to save the day means there was no team backing them up. That seems less effective: It means no one is really taking leave. No one is taking real breaks. No one is doing real handovers either on a regular basis.

The "no real handovers" aspect means that the current state of each system isn't being checkpointed. We see that as a business risk. Not unlike what source code control protects against. Imagine you didn't commit code into git or similar for awhile. Still feel safe about rolling back changes? How are you going to roll changes back? Its a similar thing in people. If you don't do any handovers then stuff stays in one individual's head.

Your situation might be different. That's fine.