|
|
|
|
|
by pommers
2751 days ago
|
|
My current teams on call is pretty taxing. Two weeks on, two weeks off (only myself and my tech lead in the team at the moment), but our alerting is pretty good. The places it falls down are where we interface with other teams who aren't on call for their systems and for them a weekend long outage is "acceptable". |
|
I suggest you look at the on-call chapters in the SRE book, SRE Workbook, and Seeking SRE.
The solution is primarily to include the development team in the on-call rotation (you build it - you run it). This can be very hard to do politically.