Hacker News new | ask | show | jobs
by KptMarchewa 1178 days ago
> Yet, everyone misses reproducibility and data versioning :)

Delta Lake/Apache Iceberg solves that.

2 comments

Absolutely not.

A single vendor/tech does not "solve" anything when the task at hand implies you need to entirely re-design data pipelines, ML modelling and benchmarking.

LMAO no, no it doesn't and has major migration consequences for existing data warehouses.

Reproducibility is more than just upstream data versioning.