|
|
|
|
|
by nrjames
601 days ago
|
|
I work in gaming and stream events into a self-hosted Clickhouse db without Kafka. We just use the CH python connector and send records in batches of 100K, using ReplacingMergeTree for backfills, etc. It works very well. Unless you truly need up-to-the-minute analytics, it’s super easy to schedule with Dagster or Airflow or whatever. We process 100M+ events per day this way. |
|
It's also kind of a bummer that the batches have to be inserted, when the tagline on Clickhouse's website is:
> Build real-time data products that scale
But, thanks for the clarification!