Hacker News new | ask | show | jobs
by sgarland 318 days ago
I read this [0] (I also recommend reading part 1 for background) a few weeks ago, and found it quite interesting.

The entire concept of data lakes seems odd to me, as a DBRE. If you want performant OLAP, then get an OLAP DB. If you want temporality, have a created_at column and filter. If the problem is that you need to ingest petabytes of data, fix your source: your OLTP schema probably sucks and is causing massive storage amplification.

[0]: https://database-doctor.com/posts/iceberg-is-wrong-2.html