hands-on lab

Capturing New Knowledge from Your Data

Intermediate
1h 15m
467
4.3/5
Get guided in a real environmentPractice with a step-by-step scenario in a real, provisioned environment.
Learn and validateUse validations to check your solutions every step of the way.
See resultsTrack your knowledge and monitor your progress.
Lab description

Databases are extraordinarily powerful in managing sets of data. This lab is aimed at students with a moderate understanding of data engineering, and Python who want to understand how to perform aggregates on data such as COUNT(), SUM(), and AVG(). We will also look into advanced statements like CASE, and functions like DATEDIFF() and CONCAT(). We will walk through the changing requirements of a bug-tracking application and how to handle them.

Learning Objectives

Upon completion of this lab you will be able to:

  • Utilize SQL aggregate functions
  • Utilize complex functions
  • Learn how to get complex insights from your data

Intended Audience

This lab is intended for:

  • Data engineers
  • Anyone interested in gaining insights from data using SQL

Prerequisites

You should possess:

  • A moderate understanding of Python
  • A basic understanding of data engineering concepts

Lab Environment

Due to the resources being provisioned for this lab, it can take up to 20 minutes from when you start the lab until the SQL instance becomes ready.

 

Updates

September 4th, 2023 - Updated instruction to use the correct table

 

About the author
Avatar
Calculated Systems
Training Provider
Students
31,665
Labs
31
Courses
13
Learning paths
42

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity.  With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing  decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.

Covered topics
Lab steps
Signing In to the Google Cloud Console
Opening the Lab's Jupyter Notebook in Google Cloud