Y
Hacker News
new
|
ask
|
show
|
jobs
by
atemerev
730 days ago
Full-range query with aggregations is around 40-60 seconds. Can be optimized, of course.
2 comments
riku_iki
730 days ago
Base on numbers you put, you would need to have stable 50-100GB/s speed to your disc system to just read 3TB data in 40-60sec. Would you mind sharing your setup?
link
winrid
730 days ago
That's only if you read everything, and if none is in ram. With a columnar DB you can just scan the values of the column you need.
link
riku_iki
730 days ago
Then 3tb of total data volume is irrelevant. What relevant is size of column.
link
dleeftink
730 days ago
How about many-to-many comparisons? Interested in this use-case for fuzzy matching of many data records using a local db.
link