|
|
|
|
|
by mbrt
578 days ago
|
|
It's my understanding that the newer generation of data lakes still make use of a tiny, strongly consistent metadata database to keep track of what is where. This is orders of magnitudes smaller than what you'd have by putting everything in the same database, but it's still there. This is also the case in newer data streaming platforms (e.g. https://www.warpstream.com/blog/kafka-is-dead-long-live-kafk...). I'm curious to hear if you have examples of any database using only object storage as a backend, because back when I started, I couldn't fin any. |
|
Take a look at Delta Lake
https://notes.eatonphil.com/2024-09-29-build-a-serverless-ac...