Hacker News new | ask | show | jobs
by mootothemax 3 hours ago
If you haven’t investigated storing in parquet format - and it doesn’t break other consumers that need your jsonl formatted files - it could be worth trialling for your use case. You’ll see vastly smaller file sizes (even more so if you use zstd compression), and querying time will shoot up.

Usual caveats apply, but as a general rule it’s held up well for me. Only downside is that inspecting the results moves from vi on the output file to duckdb and a select * from.

1 comments

I'll 100% try DuckDB in more serious projects where I would normally use Sqlite.