|
|
|
|
|
by lifeisstillgood
4205 days ago
|
|
20 million different time series.
I mean that is a lot. If you have say, 20,000 servers running that is still
1,000 different time series per server. Memory, CPU, logins, logouts, customer selections, I mean I struggle to get to those numbers. |
|
But this of course just makes the question more reasonable -- 1.2B different time series? Really?
Yup. We get a bunch of system telemetry, and a bunch of default application telemetry, without even getting traffic hitting the box, but that's a relatively small percentage of the overall volume. Developers LOVE metrics.
So imagine you want to measure requests to our API, and these are some tags you want to keep track of: request type: 5 different types result: 2 possible values (success, failure) originating country: 50 countries originating device type: 200 devices
And let's say you've got a 1000 instances reporting this data.
Suddenly you've got 5 * 2 * 50 * 200 * 1000
Oh look. Here's 100M different metrics.
And that's a relatively trivial example.