Hacker News new | ask | show | jobs
by tedk-42 2585 days ago
Running Elasticsearch on K8s storing 16TB of compressed logs across 6 data nodes and ~4600 shards.

We're a really happy ES customer. We've on ESv6 at the moment and it's been running amazingly for us. We've halved our storage and running costs by moving from 5 to 6.

We've always been a licensed customer and they are in front of AWS with their features (we run our k8s stack on AWS though :) )

3 comments

Some free advice: reduce the number of shards! Each shard comes with some state management overhead.

The soft limit is currently at 1000 shards per node, but you should be aiming at 25-50GB of data per shard.

How did ES 6 reduce your operation costs by 50%? Same CPU, disk & network bandwidth?
That's an insane number of shards, you should be closer to 500 for that amount of data and only 6 nodes.
It's due to the number of indicies/indexes stored from our various data sources. Yours and another poster's comments are interesting so we might look at ways we can reduce the shard count given the new info on overhead.