|
|
|
|
|
by cnivolle
4457 days ago
|
|
Our servers are based near Paris, in two Tiers 4+ DC.
But we are pretty close to open a point of presence in North America :) Load is computed on two metrics: CPU and RAM, scaling events are triggered when one of this metrics is aboce 85%. |
|
I'd have thousands of questions, but I guess I'll just focus on this:
* Over how much time are the load metrics computed? For how long do they have to go above 85% to trigger a scale?
* Do you compute the average load across all scalers? Or some percentile?
* Can these parameters be tuned ? Can the load and the scaling decisions be seen (a part from the bill?)
* Assuming other bottlenecks (like IO), you won't notice high CPU or RAM load, but you would notice a spike in response time. Do you allow, or have any plans, to include other metrics in the scaler or to allow customers to export custom metrics?
* Do you allow to spawn long running background processes. If yes, how does it interact with the CPU load based scaling?
ADDED:
* Can the user add placement constraints when deploying an app in multiple DCs ? Can the scalers themselves span multiple DCs in the same region. If not do you plan offering access to configuring loadbalancers to serve traffic to multiple app instances serving as a single logical app?