|
|
|
|
|
by stormbeard
645 days ago
|
|
From what I've seen, there's usually some hard requirement that a single instance of the application can handle N in-flight requests. This is fine when you have a single load balancer that uses some flavor of "least request" or "least connection" balancing algorithm. The problem arises when you have a fleet of load balancers- they don't share state, so each LB can't know the state of any individual backend without sending a request. This results in a failed request if the backend is already at full capacity. Your tone is coming off as condescending and I'm not sure if it's intentional. I guess I should mention that I'm _very_ familiar with how load balancing works. This is a real problem and you can't "tune" your way into a consistent global view of the backend states. |
|
Pre-modern computing, load balancing was a telecommunications field thing. Cloud computing is the modern 'load balancing' take.