Hacker News new | ask | show | jobs
by zX41ZdbW 1149 days ago
I've added a benchmark of ClickHouse and Athena to ClickBench:

https://pastila.nl/?0198061e/f2e0e7b2d61d0fe322607b58fc7200b...

Where ClickHouse operates in a "data lake" mode - simply by processing a bunch of parquet files on S3. Obviously it is faster than Athena. But I also want to add Presto, Trino, Spark, Databricks, Redshift Spectrum, and Boilingdata, that are currently missing from the benchmark.

Please help me adding them: https://github.com/ClickHouse/ClickBench

Also, it includes another mode of ClickHouse, named "web" - MergeTree tables hosted on a HTTP server (which is more efficient than parquet). See https://github.com/ClickHouse/web-tables-demo

About R2 - it is currently slow, and also incompatible with S3 (e.g., no multipart uploads).