Hacker News new | ask | show | jobs
by SuperQue 1412 days ago
2. Treat reliability as if it is its own feature.

It's also "Treat reliability like a software engineering problem not a process/operations problem".

1 comments

I don't know where you were working, but process/ops problems have always been "automation opportunities" for me in my professional career.

Though the majority of efforts were around making the initial designs robust and with as few moving parts as possible; sometimes automation efforts caused more outages than the dead simple operational problems.

(see also: split-brain with pacemaker/corosync on replicated databases)