|
|
|
|
|
by Zombieball
4970 days ago
|
|
Not sure about codewright's use cases. However in my own brief experimentation with SOLR I ran into performance issues with garbage collection. I setup a cluster of about 15 cc2.8xlarge machines (5 Shards with 3 replicas each) containing 240Gb worth of documents (48gb per shard). Each node was given on the order of 40GB heap space. While performing load tests with a relatively small load (~150 QPS) after a few minutes the garbage collector on nodes would kick in and run on the order of 15 to 30s. This had a cascading effect of causing zookeper to think nodes were down, start leader re-election, etc. Admittedly I am quite inexperienced when it comes to dealing with applications using such large heap sizes. Though I tried a few different JVM options with respect to GC I was unsuccessful in resolving the problem. If any folks here happen to have some good resources regarding GC and large Solr clusters I would definitely be interested. |
|
Try it again with sane GC parameters, e.g.:
Where <N> is a value between 2-8.Edit: I was benchmarking a similarly sized (though very differently configured) Solr cluster for a well-known internet company, and was able to tune it to do 5000qps, with p50 ~2ms and p99 ~20ms.