|
|
|
|
|
by ac2u
1100 days ago
|
|
Some of these need to be fixed at a higher level. On call monitoring responsibilities for a certain time period should be separate from resolution duties. In other words, aside for some well defined ops issues that have clear runbooks, the role of the person monitoring should be find out or know who to escalate to, not resolve. It's actually a great onboarding activity as it exposes new staff members to parts of the infra and operations that their managers/peers might have neglected to mention. The second way to alleviate the issues is to pair a person such as yourself with a person that has a lot of institutional knowledge so that you can triage together, learn from them, and update the docs so the organisation as a whole has better resources. Eventually, the percentage of incidents where you don't have the institutional knowledge to know how to proceed will decrease to the point where it's mostly safe for you to do on-call on your own. Then eventually you become the experienced on-call person that gets paired with the new employees that are gaining that institutional knowledge. |
|