|
|
|
|
|
by moltar
815 days ago
|
|
Yes, but this is truly an exceptional case. Their workloads are basically scraping (crawling) at a massive scale. Just like Google does, it makes more sense to have cheap throw-away hardware for this use case. There are no permission issues or ACLs. There’s no need to auto scale and the traffic is very predictable. There is no serious need to orchestrate deployments. I imagine it’s mostly just workers reading URLs from a queue and crawling a page. So very easy to deploy new servers. This is just an edge case scenario specifically great for self hosting. |
|
What is easier, having a bunch of powerful servers that provides you enough headroom or having to fight your auto-scaling group to have just enough capacity and in the end of the day still costing more?