Hacker News new | ask | show | jobs
by merrua 3125 days ago
People like the book 'Site Reliability Engineering: How Google Runs Production Systems'. You could take a look at Netflix's info and tools. They break their system themselves to ensure it recovers well. It would help if you indicated which meaning of resilient you use?