hands-on lab

Moving From Spreadsheet to Database

Beginner
1h 20m
834
3.8/5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.
Lab description

Databases are essential to organizing and understanding your data. Databases allow you to ask questions, or query your data, to find exactly what is needed. This lab is aimed at beginners who want to move beyond spreadsheets and migrate their data into a database. After completing this lab students will be able to create a basic MySQL Database in Amazon RDS, and populate it from a CSV. Additionally, students will learn how to use basic queries against their data.

Learning Objectives

  • Learn and Understand Amazon RDS Concepts
  • Utilizing Python to interact with a MySQL Database in Amazon RDS to ingest CSV Data
  • Learn how to query our newly populated database.

Intended Audience

This lab is intended for:

  • Individuals starting out with data engineering
  • Anyone interested in learning how to use a database

Prerequisites

You should possess:

  • A basic understanding of Python

To fulfill this prerequisite you can complete the Python for Beginners learning path.

Updates

October 20th, 2022 - Updated notebook to resolve database creation issue

About the author
Avatar
Calculated Systems
Training Provider
Students
31,640
Labs
31
Courses
13
Learning paths
42

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity.  With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing  decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.

Covered topics
Lab steps
Logging In to the Amazon Web Services Console
Opening the Lab's Jupyter Notebook