|
|
|
|
|
by gdubya
523 days ago
|
|
Hilbert curves are used in modern data lakehouse storage optimisation techniques, such as Databricks' "liquid clustering" [1]. This can replace the need for more traditional "hive-style" partitioning, in which the data files are partitioned based on a folder structure (e.g. `mydatafiles/YYYY/MM/DD/`). 1. https://docs.databricks.com/en/delta/clustering.html |
|