Hacker News new | ask | show | jobs
by l0k3ndr 1757 days ago
Nice read. Just curious that isn't this handled via rate throttling at load-balancer/gateway level? Like only forward requests to database as much as it can handle?
2 comments

That introduces a "mode"? In fact load-balancers / traffic-shaping themselves can misoperate often resulting in catastrophic consequences.

See chapters 19 to 22: https://sre.google/sre-book/load-balancing-frontend/

I like the idea of priority queues driven by number of prior successful requests + wait time. Such that once you're in, you get reasonable performance, otherwise you get 503 "too busy" until you've waited.