|
|
|
|
|
by sspaeti
1393 days ago
|
|
I created a small guide with the most important topics on a Data Lake and Lakehouse containing the following chapters. - What is a Data Lake & Why do you need one?
- Differences between Lakehouse & Warehouse
- Components of Data Lake
- Storage Layer (AWS S3, Azure Blob Storage, Google Cloud Storage)
- File Format (Apache Parquet, Avro, ORC)
- Table Format (Delta Lake, Apache Hudi, and Iceberg)
- Trends in the Market
- How to turn it into a Lakehouse I hope that's interesting to one or the other. Curious to hear your thoughts and opinions. What's your Data Lake Table Format of choice, and why? |
|