Hacker News new | ask | show | jobs
by phamilton 2379 days ago
Datapoint: We have 2 "DevOps guys" supporting a significant AWS infrastructure. We autoscale from 200 ec2 instances at night to 700 ec2 instances during the day. We run 60+ microservices, each of which has multiple processes that run, each of which is autoscaled (we use ECS). We use Aurora (with autoscaled readers) and DynamoDB (autoscaled IOPS). We manage all of that with 2 "Devops Guys".

Granted, we're a mature startup and have put a few years of investment (at the cost of 2-3 "Devops Guys") into our infra, but ultimately it doesn't take much to manage a ton of AWS infra once the tooling is in place.

2 comments

Man, that sounds so luxurious. I'm begging for us to hire a second guy because I'd like to not always be on point for everything and to take vacations. Probably running an order of magnitude more stuff than you described, multi-cloud and with Terraform.

Terraform and the fact that I came in with experience makes this doable. But only just.

just for back of the envelope, who many customers are you able to support per ec2 instance?
WhatsApp used to be hosted on ~15 bare metal servers serving 100 million concurrent users...
They were also acquired at a price which would value each employee at ~350M.

They were capable of scaling in a way that is certainly an anomaly, and not indicative of the costs of an ordinary team.

It speaks volumes about what the right talent and architecture/technology choices can do if leveraged successfully, but is more of an interesting anecdote than a realistic infrastructure budget.

> They were also acquired at a price which would value each employee at ~350M.

That’s a pointless calculation. The acquisition wasn’t for the employees. As with all network-effects products, the acquisition was for the active user base. They could have acquired WhatsApp, fired the engineering team, rewrote it with an architecture that required 100x the servers and still been happy.

It speaks volumes about Erlang/BEAM I think
I find this so fascinating, is there more info on the software/hardware during this time period?
Before ist was possible to Share Images. BTW they used s3 dir that
We have give or take 30M monthly active users.

The instances we use are not the largest (we use 2xls) but we also incorporate spot instances as part of our autoscaling.