|
|
|
|
|
by sspaeti
1277 days ago
|
|
> The column-oriented storage format used by data warehouses allows them to efficiently leverage modern SIMD computer architectures for columnar-vectorized processing. I find it interesting how these vectorized processing engines with DuckDB and Photon Engine of Databricks try to combine row and columnar-oriented strengths. |
|
> When to use a data warehouse > Data warehouses are good for OLAP (online analytical processing) workloads such as the following:
> A small number of users, each of which may execute heavy analytics workloads
Not necessarily. ClickHouse is being used for user-facing analytics where every query can take in the order of 10 ms while supporting many concurrent queries.
> Downtime is permitted – generally not used as the one-and-only operational system
Not necessarily. ClickHouse is being used for HA setups replicated across multiple regions when the service has to survive the outage of a whole region.