|
|
|
|
|
by r0ckysharma
1171 days ago
|
|
1. 80,000 - 125,000 ( on peak per minute ) - expecting 5-6 times in increase in coming few months. 2. Size of each insert ( approx 1 KB ) 3. Year end datasize = Not available ( too early to guess, but average 600-700 GB ) 4. Query must finish in around a minute or around. |
|
The biggest problem you're going to face is ingestion of these events during peaks at 500k events per minute. You can't ingest them individually into Clickhouse or most other databases. So unfortunately you will have to add one additional streaming layer to cache these events so you can create batches of events once every few seconds and ingest a big batch of 1k-10k events into Clickhouse. AWS API Gateway + Kinesis is operationally easy to set up and quite cheap and should be able to handle your peak load. Afterwards use a Lamda to batch >1000 events from Kinesis and insert into Clickhouse. I've never tested this last part so I'm not sure how it will work out.
It'd be nice to know what you eventually go with. Please send me a message if you can of what you've finally chosen.