1. Home
  2. Training Library
  3. Amazon Web Services
  4. Courses
  5. Understanding Data Lakes in AWS

What is a Data Lake?

What is a Data Lake?
Overview
Difficulty
Intermediate
Duration
14m
Students
108
Ratings
5/5
starstarstarstarstar
Description

Many organizations have implemented data lakes to great success, giving them a tactical business edge through the use of data analysis and predictive analytics.

This course covers the basics of data lakes, how they are different from data warehouses, and the components that make up a successful data lake.

Learning Objectives

  • Understand the difference between data warehouses and data lakes
  • Know what qualities make up a good data lake.
  • Learn about AWS Lake Formation and how it can transform the process of creating a data lake from taking months to days

Intended Audience

This course is intended for anyone who is responsible for managing business data or for those interested in creating a data lake in general.

Prerequisites

To get the most out of this course, you should have a decent understanding of cloud computing and cloud architectures, specifically with Amazon Web Services.

Transcript

What is a data lake? A data lake is a place for your business or enterprise to store and collect data. The data you store in your data lake may be structured or unstructured, meaning it can have a defined schema or not. 

The goal of our data lake is to have a single place where all of our business information can exist, and eventually, we can have some type of analytics performed on it. This data can be from our transactional systems and line of business applications. It could also be from various IoT devices, mobile applications, and even social media.

Companies that are able to aggregate, and work on their data, and derive meaning from it will be able to outperform their peers. These companies might do so through the use of generic data analytics or even by using machine learning to provide valuable insights.

This is why it is important to manage and create a safe place for all your data to live, A.K.A. a data lake.

 

About the Author
Avatar
Will Meadows
Senior Content Developer
Students
4298
Courses
24

William Meadows is a passionately curious human currently living in the Bay Area in California. His career has included working with lasers, teaching teenagers how to code, and creating classes about cloud technology that are taught all over the world. His dedication to completing goals and helping others is what brings meaning to his life. In his free time, he enjoys reading Reddit, playing video games, and writing books.