|
|
|
|
|
by sgarland
318 days ago
|
|
I read this [0] (I also recommend reading part 1 for background) a few weeks ago, and found it quite interesting. The entire concept of data lakes seems odd to me, as a DBRE. If you want performant OLAP, then get an OLAP DB. If you want temporality, have a created_at column and filter. If the problem is that you need to ingest petabytes of data, fix your source: your OLTP schema probably sucks and is causing massive storage amplification. [0]: https://database-doctor.com/posts/iceberg-is-wrong-2.html |
|