|
|
|
|
|
by gt565k
2617 days ago
|
|
Oh fuck me, I didn't even realize they used a single instance (node). To expand a little bit, the whole point of using multiple shards per index in an ES cluster is so that the shards spread across multiple nodes (servers) and distribute the load (disk i/o) and handle redundancy. ES automatically scales and reshuffles its shards across multiple nodes in the cluster to handle fault-tolerance as well. If one or more nodes go down, the cluster still has all of the data through replica shards etc... Either way, in this particular case, the data is so small, having 5 shards per index with 50k indices results in 250k shards for 5GBs of data. 5GB / 250k shards = 20kb per shard. You have shards of size ~ 20kb ... total cluster misconfiguration. |
|