|
|
|
|
|
by cube2222
960 days ago
|
|
It would be interesting to see some more benchmarks of e.g. querying multiple files from S3, and how that evolved across versions. When I checked at 0.7.1, when working with ~90 S3 parquet objects (x0000 rows each, so not too many) it was 25-50% faster to first download them in Go and then query them, rather than using the DuckDB S3 extension with those objects directly (the whole execution ran on the order of a couple hundred milliseconds). |
|
It’s not a DuckDB specific issue (although there’s headroom for improvement — I don’t think DuckDB’s S3 connector is highly optimized). It’s S3.