|
|
|
|
|
by robinhoodexe
559 days ago
|
|
We’re running standard Prometheus on Kubernetes (14 onprem Talos clusters, total of 191 nodes, 1.1k cpu cores, 4.75TiB memory and 4k pods). We use Thanos to store metrics in self-hosted S3 (seaweedfs) with 30 days retention, aggressively downsample after 3 days. It works pretty good tbh. I’m excited about upgrading to version 3, as is does take a lot of resources to keep going, especially on clusters with a lot of pods being spawned all the time. |
|
We've been using Talos for our internal clusters for a while, but with quite small ones (3 kube node, 5 worker nodes).
Upgrading has been generally a non event, and we're quite happy with them.
How do you deploy Thanos ? In one of the clusters ?