Getting Started with Amazon Redshift

Beginner

167 students completed the lab in ~54m

Total available time: 1h:30m

60+ students rated this lab!

Lab Overview

Amazon Redshift is a managed data warehouse that allows you to analyze all your data using standard SQL and your existing Business Intelligence (BI) tools. Redshift uses query optimization, columnar storage, parallel execution, and high performance disks to query petabytes of data in seconds. In this lab you will learn how to create, query, and resize a Redshift cluster.

Lab Objectives

Upon completion of this lab you will be able to:

  • Log in to the AWS Management Console

  • Connect to an EC2 instance to communicate with Redshift

  • Create and resize a Redshift cluster

  • Load data into Redshift

  • Query data within Redshift

Lab Prerequisites

You should be familiar with:

  • Basic understanding of local operating system and computer use

  • Secure terminal connection software such as Terminal (macOS) or PuTTY (Windows)

Lab Environment

Before completing the lab instructions the environment will look as follows:

After completing the lab instructions the environment should look similar to:

Follow these steps to learn by building helpful cloud resources

Log In to the Amazon Web Services Console

Your first step to start the Lab experience

Creating the Redshift Cluster

Create your first cluster

Connecting to the Virtual Machine using SSH

Create a secure connection to a remote machine

Retrieving the Redshift IAM role

Record the Redshift IAM role

Loading data into Redshift

Create tables and copy data

Running sample queries

Use SQL to retrieve specific data

Resizing the cluster

Add another node to your Redshift cluster

Cleaning up the environment

Delete the lab resources