Mastering Delta Lakes in Azure
You'll walk away with all the skills needed to get started on your Delta and workshops material & labs for future reference.
Once upon a time we had the Data Warehouse, life was good but it had its limitations, particularly around loading/storing complex data types. As data grew larger and more varied, the warehouse became too rigid and opinionated.
So we dove headfirst into Data Lakes to store our data. Again, things were good, but missed some of the good times that the Data Warehouse had given us. The lake had become too flexible, we needed stability in our life. In particular, we needed A.C.I.D (Atomicity, Consistency, Isolation, and Durability) Transactions.
Delta Lake, hosted by the Linux Foundation, is an open-source file layout protocol for giving us back those good times, whilst retaining all of the flexibility of the lake. Delta has gone from strength to strength, and in 2022 Databricks finally open-sourced the entire code-base, including lots of advanced features that were previously Databricks-only. This workshop takes you from the absolute basics of using Delta within a Lake, through to some of those advancing engineering features, letting you really master your Delta Lake.
In this workshop we will go from Zero to Hero with Delta, including:
- Handling Schema Drift
- Applying Constraints and Database Designs
- Time-Travel & Management
- Optimize & Performance Tuning
- Streaming
We will also show you how to work with Delta inside and outside of its original home of Databricks.
Wednesday 15 March, 2023 at 9:00 am (GMT)
Theme:
Big Data & Data Engineering
Session Level:
Beginner
Required Experience:
N/A
Technologies covered:
Data Lake & Databricks