| > assumptions about always-on digital services And maybe also always-on humans, which some companies seem to ridiculously expect. I really don't understand this obsession with 24/7 uptime for non-critical systems. Requiring your engineers to be always on-call and debug something at 3am is a health hazard and should be treated like one. If a photo-sharing app is down at 3am, I'm sure the users can go to sleep and wait till 10am. This isn't some oxygen life support system. If you have that many users in multiple time zones, then hire people in multiple time zones. Even if TurboTax crashes on 4/15 at 11:35pm and the engineers don't fix it until the next workday, resulting in millions of people not being able to file their taxes, I'm sure the IRS might grumble a lot but would give people an extension. It'll all be good, and everyone will get to sleep . |
That's way too big of a risk, and way too much stress to put on your customers.
For something like tax software, you should have people on call, or even 24/7 staffing, for that specific week. 2% of the year.
In general, big release dates or important deadline should often have extra resources. 0-10 days per year. Pay extra for the health hazard, but that doesn't mean don't do it.