|
|
|
|
|
by Jarwain
380 days ago
|
|
That's fair! I guess I see it as trading technical complexity with the human complexity of getting everyone on board with an update to the standard, and getting that standard implemented across the board. It's a lot easier to get my coworkers to just use duckdb as a reader/writer with ducklake than to change the system. Frankly, I'm not entirely sure what the process of proposing that change to the hive file scheme would even look like |
|
Maybe convince DuckDB and/or clickhouse-local and/or polars.scan_parquet to implement it as a pilot? If it's a success, other tools might follow suit.
Or maybe something like DuckLake could have an option to put column statistics in the filenames. I raised this as a discussion:
https://github.com/duckdb/ducklake/discussions/92