Hacker News new | ask | show | jobs
by stephankoelle 722 days ago
I can confirm that. 10tb uncompressed, 3tb compressed in clickhouse, 60 billion Rows, queries are fast.
1 comments

Really curious what "fast" is in this context. Milliseconds, seconds, minutes?
Full-range query with aggregations is around 40-60 seconds. Can be optimized, of course.
Base on numbers you put, you would need to have stable 50-100GB/s speed to your disc system to just read 3TB data in 40-60sec. Would you mind sharing your setup?
That's only if you read everything, and if none is in ram. With a columnar DB you can just scan the values of the column you need.
Then 3tb of total data volume is irrelevant. What relevant is size of column.
How about many-to-many comparisons? Interested in this use-case for fuzzy matching of many data records using a local db.