|
|
|
|
|
by Plough_Jogger
1424 days ago
|
|
My sense from the few projects I've seen attempt to use Druid is that there is quite a lot of infrastructure overhead / DevOps support required to manage a cluster at scale, and that fairly complex ingestion pipelines are required to load the data in the right format. Anecdotally, I've heard that ClickHouse is easier to deploy from this perspective with similar performance, but would love to get others views / experience with these and similar data stores. |
|
Especially since ingestion goes straight to S3. We don’t really worry about backups (just deal with PG backups).
Just make sure your ZK is happy and all will be well.
The hard part about Druid is tuning:
- the ingestions: Spec definition, compaction, sharding strategy, RAM consumptions, etc.
- and query performance: RAM consumptions, number of threads, timeouts, etc.