|
|
|
|
|
by thingsilearned
2429 days ago
|
|
We're definitely not trying to start from scratch or throw out all the old knowledge/practices - just update them for the common data stacks used today. In the book we use much of the old terms and recommendations. Most of the high level organizing is still totally right - but a lot of the optimizing and work done for performance and cost reasons is very different now. For example ELT makes now much more sense than ETL for the reasons Kostas wrote about here: https://dataschool.com/data-governance/etl-vs-elt/ And many things previously done for cost and performance reasons are just not relevant anymore thanks to the big innovations in C-Store warehouses. |
|
The difference now is, you have Hadoop and cloud providers that will take credit cards and give you as much space as you can pay for. The concept is not new, it was just cost was a factor back then because capacity was fixed and memory was expensive.
the only thing that has changed is the commoditization of hardware has allowed for different behaviors that would have been cost prohibitive.