Hacker News new | ask | show | jobs
by sspaeti 1277 days ago
> The column-oriented storage format used by data warehouses allows them to efficiently leverage modern SIMD computer architectures for columnar-vectorized processing.

I find it interesting how these vectorized processing engines with DuckDB and Photon Engine of Databricks try to combine row and columnar-oriented strengths.

1 comments

The author of the article underestimates the capabilities of the modern OLAP DBMS. The old beliefs are becoming less and less relevant.

> When to use a data warehouse > Data warehouses are good for OLAP (online analytical processing) workloads such as the following:

> A small number of users, each of which may execute heavy analytics workloads

Not necessarily. ClickHouse is being used for user-facing analytics where every query can take in the order of 10 ms while supporting many concurrent queries.

> Downtime is permitted – generally not used as the one-and-only operational system

Not necessarily. ClickHouse is being used for HA setups replicated across multiple regions when the service has to survive the outage of a whole region.