Hacker News new | ask | show | jobs
by bigdubs 1226 days ago
We use DuckDB extensively where I work (https://watershed.com), the primary way we're using it is to query Parquet formatted files stored in GCS, and we have some machinery to make that doable on demand for reporting and analysis "online" queries.
2 comments

Storing data in Parquet files and querying via DuckDB is fast and kind of magical.
Shockingly fast and nice and having the intermediate files be immutable is super nice.
Haven't used it yet, but this aspect seems very appealing.
Do you load the Parquet files in duckdb or just query them directly?
We query them directly in most cases with the httpfs plugin, but for hot paths we fetch them and cache them on disk locally.