Google Data Engineer Exam – Professional Certification Preparation

AVG Duration19h


This learning path is designed to help you prepare for the Google Certified Professional Data Engineer Exam. Even if you don't plan to take the exam, these courses will help you gain a solid understanding of the various data processing components of the Google Cloud Platform.

At the heart of Google’s big data services is BigQuery, a managed data warehouse in the cloud. The first three courses will show you how to load and query data in BigQuery, optimize BigQuery’s performance, and visualize your data.

The next three courses will show you how to process your data. First, you will use Cloud Machine Learning Engine to train neural networks to perform predictive analytics. Next, you’ll use Cloud Dataflow and Cloud Dataproc to build data processing pipelines that transform and summarize your data using Apache Beam, Hadoop, and Spark.

The final course will introduce you to Bigtable, Google’s revolutionary NoSQL database. It will show you how to take advantage of Bigtable’s high performance for big data applications.

All of these courses include hands-on demos you can do yourself. Then you can test what you’ve learned by taking the practice exam.

Learning Objectives

  • Design a data processing system
  • Build and maintain data structures and databases
  • Analyze data and enable machine learning
  • Optimize data representations, data infrastructure performance, and cost
  • Ensure reliability of data processing infrastructure
  • Visualize data
  • Design secure data processing systems


  • Basic database knowledge

Intended Audience

  • Data professionals
  • People studying for the Google Professional Data Engineer exam


If you have thoughts or suggestions for this learning path, please contact Cloud Academy at


Your certificate for this learning path

Training Content

Course - Beginner - 46m
Overview of Google Cloud Platform
In this course, you'll learn about GCP services such as compute, storage, and networking, and how to create virtual machines and web apps using the Google Cloud Console and gcloud CLI.
Exam - 20m
Knowledge Check: Overview of Google Cloud Platform
Knowledge Check: Overview of Google Cloud Platform
Course - Intermediate - 16m
Introduction to Google Cloud Data Loss Prevention
This course is a short introduction to Google Cloud Data Loss Prevention, which is a service that finds and de-identifies sensitive data, such as birthdates and credit card numbers.
Hands-on Lab - Intermediate - 45m
Inspecting and De-Identifying Data With Google Cloud Data Loss Prevention
In this lab, you will inspect data stored in Cloud Storage and understand the sensitive information therein.
Course - Intermediate - 1h 49m
Google Cloud Platform: Systems Operations
This course covers Google Cloud systems operations, providing insight and practical information across the complete set of GCP features.
Course - Intermediate - 1h 8m
Designing a Google Cloud Infrastructure
This course uses a case study to show how to apply the design principles of security, compliance, and disaster recovery to meet real-world requirements.
Hands-on Lab - Intermediate - 45m
Granting Access to Google Cloud Storage Objects with Signed URLs
Use the gcloud CLI in Google Cloud Shell to create signed URLs to grant anyone access to objects stored in Google Cloud Storage for a set duration in this Lab.
Course - Advanced - 1h 13m
Managing Your Google Cloud Infrastructure
This hands-on tutorial teaches you monitoring, testing, managing, and troubleshooting your GCP app infrastructure.
Hands-on Lab - Intermediate - 45m
Run SQL Queries and Analyze the DB with Google Cloud SQL
In this lab, you will create two tables in a SQL PostgreSQL database, perform operations on them, monitor the resources usage and test that the atomicity property is respected by the database.
Course - Beginner - 36m
Introduction to Google BigQuery
Learn how to load data into BigQuery, run queries using standard SQL, and export data from BigQuery with this hands-on course.
Hands-on Lab - Beginner - 35m
Structure and Analyze Data with Google BigQuery
This Lab will show you the basic concepts of BigQuery and will allow you to handle data and query them in a real GCP environment.
Course - Intermediate - 37m
Optimizing Google BigQuery
Learn how to make BigQuery faster, cheaper, and more secure with this hands-on course.
Course - Intermediate - 40m
Visualizing BigQuery Data with Google Data Studio
With this course, you'll learn how to visualize BigQuery Data with Google Data Studio and create BigQuery reports.
Course - Intermediate - 1h 3m
Introduction to Google AI Platform
In this course, you'll learn how to train and deploy neural networks with Google AI Platform.
Course - Advanced - 38m
Building Convolutional Neural Networks on Google Cloud
Learn how to build a CNN, train it on Machine Learning Engine and visualize its performance. Learn how to recognize overfitting and apply different methods to avoid it.
Hands-on Lab - Intermediate - 45m
Analyze and Retrieve Information from Text Using Google Cloud Natural Language
Course - Intermediate - 1h 9m
Introduction to Google Cloud Dataflow
In this course, you'll learn how to write data processing programs using Apache Beam and run them using Cloud Dataflow, as well as learning how to run both batch and streaming jobs.
Hands-on Lab - Intermediate - 1h
Deploy a Message Queuing Solution With Google Cloud Pub/Sub
Google Cloud Pub/Sub is a message queuing service that allows you to deploy topics and attach subscriptions to them. Once a message is sent to the topic, it will send the message to all the attached subscriptions.
Course - Intermediate - 49m
Introduction to Google Cloud Dataproc
In this course, you'll learn how to run Hadoop and Spark jobs on GCP.
Course - Intermediate - 48m
Introduction to Google Cloud Bigtable
In this course, you'll learn which of your applications could make use of Bigtable and how to take advantage of its high performance.
Hands-on Lab - Beginner - 45m
Managing a Redis Instance Using Google Cloud Memorystore
In this lab, you will create and manage a Redis instance by using Google Cloud Memorystore.
Course - Intermediate - 1m
Additional Topics for Google Data Engineer
This short video lists some of the other resources you should review before taking the Google Certified Professional Data Engineer exam.
Resource - Beginner - 4h
Required Reading for Google Data Engineer Exam
Exam - 2h
Cert Prep: GCP Data Engineer
Cert Prep: GCP Data Engineer
About the Author
Learning paths87

Guy launched his first training website in 1995 and he's been helping people learn IT technologies ever since. He has been a sysadmin, instructor, sales engineer, IT manager, and entrepreneur. In his most recent venture, he founded and led a cloud-based training infrastructure company that provided virtual labs for some of the largest software vendors in the world. Guy’s passion is making complex technology easy to understand. His activities outside of work have included riding an elephant and skydiving (although not at the same time).