Understanding ETL Services on AWS

3m 14s

This lesson introduces the Understanding ETL Services on AWS course, designed to explain two vital extract, transform, and load or ETL services on AWS. The two services this course will cover are AWS Glue and Amazon EMR. In particular, we will cover the following:

  • Serverless ETL
  • The knowledge and architecture of a typical ETL project
  • The prerequisite setup of AWS parts to use AWS Glue for ETL
  • Knowledge of how to use AWS Glue to perform serverless ETL
  • How to edit ETL processes created from AWS Glue
  • A foundational understanding of Amazon EMR
  • Core Amazon EMR characteristics
  • The EMR base architecture
  • How AWS Glue compares to Amazon EMR
  • How to make ETL processes more automated and repeatable using orchestration services such as AWS Data Pipeline, AWS Glue Workflows, and AWS Step Functions 
  • Knowledge of the AWS Data Wrangler library and how it provides an abstraction for extract and load operations on AWS services
About the Author
Alana Layton, opens in a new tab
Sr. AWS Content Creator
Learning paths

Alana Layton is an experienced technical trainer, technical content developer, and cloud engineer living out of Seattle, Washington. Her career has included teaching about AWS all over the world, creating AWS content that is fun, and working in consulting. She currently holds six AWS certifications. Outside of Cloud Academy, you can find her testing her knowledge in bar trivia, reading, or training for a marathon.

Covered Topics