Using Azure Databricks to Import and Analyze Data

Lab Steps

lock
Logging into the Microsoft Azure Portal
lock
Adding Customer Data to Azure Data Lake Store
lock
Creating an Azure Databricks Workspace
lock
Creating a Spark Cluster and Python Notebook in Azure Databricks
lock
Importing Azure Data Lake Storage Data into Databricks
lock
Interacting with Data in Azure Databricks

The hands-on lab is part of these learning paths

DP-201 Exam Preparation: Designing an Azure Data Solution
course-steps
11
certification
1
lab-steps
5
Big Data Analytics on Azure
course-steps
3
certification
1
lab-steps
2

Ready for the real environment experience?

DifficultyBeginner
Time Limit1h
Students405
Ratings
4.6/5
starstarstarstarstar-half

Description

Azure Databricks is an analytics platform powered by Apache Spark. Spark is a unified analytics engine capable of working with virtually every major database, data caching service, and data warehouse provider. In addition to it working with most providers, companies use Spark because it uses in-memory computing among other optimizations to offer very fast analytics. Azure Databricks enables companies to integrate their data analytics solutions into their existing Azure infrastructure. In this lab, you'll load data into Azure Data Lake Store and use Databricks to interact with that data through a Databricks workspace and cluster that you'll configure.

Learning Objectives

Upon completion of this lab you will be able to:

  • Load data into Azure Data Lake Store
  • Create and manage a Databricks workspace
  • Create and manage a Databricks cluster
  • Mount data into a Databricks workspace from Azure Data Lake Store
  • Interact with data using Databricks

Intended Audience

This lab is intended for:

  • Azure administrators
  • Cloud engineers and solutions architects
  • Data engineers
  • Anyone with a need to visualize and analyze data in Azure

Prerequisites

You should be familiar with:

Updates

June 22nd, 2020 - Clarified the format of the Azure Data Lake Storage URL and included a screenshot to avoid confusion 

Environment before
PREVIEW
arrow_forward
Environment after
PREVIEW
About the Author
Students32667
Labs42
Learning paths1

Matt has worked for multiple Fortune 500 companies as a DevOps Engineer and Solutions Architect. He is an AWS Certified DevOps Engineer - Professional, and an AWS Certified Solution Architect - Associate. He enjoys reading and learning new technologies.