Y
Hacker News
new
|
ask
|
show
|
jobs
by
ZephyrBlu
722 days ago
Really curious what "fast" is in this context. Milliseconds, seconds, minutes?
1 comments
atemerev
722 days ago
Full-range query with aggregations is around 40-60 seconds. Can be optimized, of course.
link
riku_iki
722 days ago
Base on numbers you put, you would need to have stable 50-100GB/s speed to your disc system to just read 3TB data in 40-60sec. Would you mind sharing your setup?
link
winrid
722 days ago
That's only if you read everything, and if none is in ram. With a columnar DB you can just scan the values of the column you need.
link
riku_iki
722 days ago
Then 3tb of total data volume is irrelevant. What relevant is size of column.
link
dleeftink
722 days ago
How about many-to-many comparisons? Interested in this use-case for fuzzy matching of many data records using a local db.
link