Hacker News new | ask | show | jobs
by benjaminwootton 1261 days ago
A column oriented database is probably the bigger performance increase. Parquet and a good data warehouse (something like Clickhouse, Druid or Snowflake) will both use metadata and efficient scans to power through aggregation queries.