Introduction to Delta Lake on Azure Databricks

Intermediate
1m
2,472
5/5

Delta Lake is an open-source storage layer that’s included in Azure Databricks. It supports structured and unstructured data, ACID transactions, and batch and stream processing. This lesson provides an overview of Delta Lake, including some history of earlier data solutions and why you might choose Delta Lake instead. You'll learn how to use and optimize Delta Lake for your own workloads.

Learning Objectives

  • Understand what Delta Lake is and what it's used for
  • Learn how to optimize Delta Lake

Intended Audience

This lesson is intended for anyone who wants to learn how to use Delta Lake on Azure Databricks.

Prerequisites

To get the most from this lesson, you should already have some knowledge of Apache Spark and Azure Databricks. If you’re not familiar with those, then you should take our Running Spark on Azure Databricks lesson. It would also be helpful to have some experience with SQL.

About the Author
Avatar
Guy Hummel, opens in a new tab
Azure and Google Cloud Content Lead
Students
235,267
Courses
103
Learning paths
168

Guy launched his first training website in 1995 and he's been helping people learn IT technologies ever since. He has been a sysadmin, instructor, sales engineer, IT manager, and entrepreneur. In his most recent venture, he founded and led a cloud-based training infrastructure company that provided virtual labs for some of the largest software vendors in the world. Guy’s passion is making complex technology easy to understand. His activities outside of work have included riding an elephant and skydiving (although not at the same time).

Covered Topics