| Hello HN. I'd like to pick your collective brain. I have 18 years of experience as a software engineer, and I'm considering a role inside Google's SRE (Site Reliability Engineering) organization. I have very little operational experience; my career so far (and certainly in recent history) has been almost pure development. I'm wondering if anybody could share their experiences from inside the Google SRE group. 1. For people who went from being a pure developer to a SRE, how hard was the adjustment? 2. How do you feel about being on-call? 3. How much will my lack of operational experience hurt me? 4. What is the balance between operational work and project work? 5. For people who decided to leave the SRE group, why did you decide to leave? 6. Any regrets? I have another potential match with a non-SRE team, and I'm weighing both options. Thank you in advance for any information or advice. It's a big decision. |
1. It was an interesting adjustment. The work is qualitatively different but it's still software engineering. Just at a higher level.
2. On-call is the best I've ever had at any company. It's 12h a day max and overtime is compensated. You won't be woken up in the middle of the night. There's different tiers as well. I'm tier 1 so that means a five minute response time. That's too much for some people and I don't blame them.
3. My team is ramping up people straight out of university. You'll be well trained and have half a year or longer depending on how critical the service is.
4. In my org it's something like 25% of your time on-call at most. Depending on the size of your team it's less.
5. Not applicable. My first SRE job.
6. None so far. Google is very nice to work for. Like all jobs it'll be stressful at times but it's better here than anywhere else I've worked.