Hacker News new | ask | show | jobs
by reverseblade2 162 days ago
This roadmap focuses on:

triage before diagnosis

when dashboards lie

why doing nothing is sometimes correct

partial failures and cascading effects

humans under stress

turning incidents into better architecture