Hacker News new | ask | show | jobs
by closeparen 2884 days ago
I have to disagree. The typical and intuitive ways of reasoning about outages and outage risk - screaming at the engineers until they fix it, desperately passing the buck, finding someone to fire in the aftermath - are not a good fit for any context. Every company can benefit from a more principled mental model of system reliability.
1 comments

If your company's management doesn't even know what an SRE is, then you're stuck in the same exact place, where the SREs are the one being screamed at instead. Some companies just rename "devops" to "SRE".
I think the renaming is fine as long as it also comes with the responsibility of driving the tracking and improving of site reliability :)
I just renamed my microwave to "refrigerator", but all of my food caught on fire and started leaking operational debt! :(