Hacker News new | ask | show | jobs
by jack6e 2924 days ago
At PyCon in Cleveland this year Amjith Ramanujam did a presentation on "how Netflix does failovers in 7 minutes flat" [0]. Worth a watch/listen for anyone interested in what their response may have looked like. Now I'm curious to read a post-mortem from them and see whether their procedures worked as expected (it sounds like they were down longer than 7 minutes) or where they encountered unexpected issues.

[0] https://www.youtube.com/watch?v=iQI56-up3Yk

1 comments

Is 7 minutes good? If you have a multimaster redundant systems, you don't need failover.
You write that as to indicate it is easy do. It's more than "a multimaster db" to Netflix, it's CDNs, authentication, logging, security...