| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by pythonwutang 2471 days ago

> Therefore, the best thing to do is to deploy your program without resource limits, observe how your program behaves during idle/ regular and peak loads, and set requested/ limit resources based on the observed values.

This is one of the author’s fatal assumption. The best practice I understand is to set cpu requests to be around 80% of peak and limits to 120% of peak before deploying to prod.

They set themselves up for disaster with this architecture where they have many idle pods polling for resource availability. This resource monitoring should have been delegated to a single pod.

Also it’s really unclear what specific strategy led to extra costs of 1000s of dollars...