hands-on lab

Efficiently Storing Data in S3 for Data Analytics Solutions

Beginner

Up to 1h 15m

2,066

4.6/5

Start lab

Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.

Learn and validateUse validations to check your solutions every step of the way.

See resultsTrack your knowledge and monitor your progress.

Description

Amazon S3 is a fully managed service for storing data in the cloud. Amazon S3 frees you from managing servers, NAS/SAN devices, and from having to worry about individual physical disks.

Amazon S3 is very flexible and because of that, it is used in a large number of different types of solutions. When building a solution in AWS and you need storage, S3 is likely the best option when considering cost and performance. If you are using S3 with AWS Data Analytics services, there are a number of things you should be aware of to minimize costs and maximize the performance of your Data Analytics solution.

In this lab, you will create data, store it in S3, and transform the data to be more performant and cost-efficient.

Learning Objectives

Upon completion of this beginner level lab you will be able to:

Use the AWS command-line tool to copy data from an EC2 instance to S3
Partition data files in S3
Compress data to reduce costs
Convert data into different formats to reduce costs and maximize performance

Intended Audience

Candidates for the AWS Certified Data Analytics Speciality exam
Data Engineers
Cloud Engineers

Prerequisites

Experience with S3 and the Linux command-line will be beneficial but is not required.

The following courses can be used to fulfill the prerequisites:

Updates

March 12th, 2023 - Resolved an issue that caused the lab to fail to set up on rare occasions

January 11th, 2023 - Updated screenshots & instructions to reflect latest UI

October 26th, 2022 - Resolved an issue preventing lab setup from completing

January 28th, 2022 - Updated the instructions and screenshots to reflect the latest UI

June 24th, 2021 - Improved instructions

Environment before

Environment after

About the author

Andrew Burchill, opens in a new tab

Labs Developer

Students

68,469

Labs

170

Courses

Learning paths

Andrew is a Labs Developer with previous experience in the Internet Service Provider, Audio Streaming, and CryptoCurrency industries. He has also been a DevOps Engineer and enjoys working with CI/CD and Kubernetes.

He holds multiple AWS certifications including Solutions Architect Associate and Professional.