image
AWS Data Pipeline vs. AWS Glue
Introduction
Difficulty
Intermediate
Duration
28m
Students
209
Ratings
5/5
Description

In this course, we will compare Amazon EMR and AWS Glue and cover ways to make ETL processes more automated and repeatable.

Learning Objectives

  • What AWS Glue is and how it works 
  • How AWS Glue compares to Amazon EMR 
  • How to make ETL processes more automated and repeatable using orchestration services such as AWS Data Pipeline, AWS Glue Workflows, and AWS Step Functions

Intended Audience

  • Those who are implementing and managing ETL on AWS

  • Those who are looking to take an AWS certification — specifically the AWS Certified Solutions Architect – Associate Certification or the AWS Certified Data Analytics - Specialty Certification

Prerequisites 

In this course, I will provide introductory information on AWS Glue. However, to get the most from this course, you should already have an understanding of Amazon EMR and Amazon EC2. For more information on these services, please see our existing content titled: 

Transcript

Hello and welcome to this course, which will talk about some of the core data processing services on AWS.

Working with data is often an ugly experience. Data needs cleaning, validation, transforming, and enriching. This is where ETL, or extract, transform, and load, processing comes into play. AWS offers a few ETL services that will properly categorize, clean, and enrich your data. Two of the main ETL services are Amazon EMR and AWS Glue. In this course, I’ll be comparing the two services, and discussing ways to make ETL processes more automated and repeatable.

Before we get any further - I’d like to introduce myself. My name is Alana Layton, and I am an AWS content creator here at Cloud Academy.  Feel free to connect with me to ask any questions using the details shown on the screen. Alternatively, you can always get in touch with us here at Cloud Academy by sending an e-mail to support@cloudacademy.com where one of our Cloud experts will reply to your question.

This course has been created for those who are implementing and managing ETL on AWS, and for those who are looking to take an AWS certification - specifically, the AWS Certified Solutions Architect – Associate Certification or the AWS Certified Data Analytics - Specialty Certification. 

By the end of this course, you will have a greater understanding of ETL processes on AWS, including:

  • What AWS Glue is and how it works 

  • How AWS Glue compares to Amazon EMR

  • How to make ETL processes more automated and repeatable using orchestration services such as AWS Data Pipeline, AWS Glue Workflows, and AWS Step Functions 

In this course, I will provide introductory information on AWS Glue. However, to get the most out of this course, you should already have an understanding of Amazon EMR and Amazon EC2. For more information on these services please see our existing content, titled Introduction to Amazon Elastic Map Reduce and Compute Fundamentals for AWS. Feedback on our courses here at Cloud Academy is valuable to both us as trainers and any students looking to take the same course in the future. If you have any feedback, positive or negative, it would be greatly appreciated if you could contact support@cloudacademy.com.

Please note that, at the time of writing this content, all course information was accurate.  AWS implements hundreds of updates every month as part of its ongoing drive to innovate and enhance its services.

As a result, minor discrepancies may appear in the course content over time.  Here at Cloud Academy, we strive to keep our content up to date in order to provide the best training available. So, if you notice any information that is outdated, please contact support@cloudacademy.com.  This will allow us to update the course during its next release cycle.

Thank you! 

About the Author
Students
2943
Courses
27
Learning Paths
5

Alana Layton is an experienced technical trainer, technical content developer, and cloud engineer living out of Seattle, Washington. Her career has included teaching about AWS all over the world, creating AWS content that is fun, and working in consulting. She currently holds six AWS certifications. Outside of Cloud Academy, you can find her testing her knowledge in bar trivia, reading, or training for a marathon.