Google Data Engineer Exam – Professional Certification Preparation
Description
This learning path is designed to help you prepare for the Google Certified Professional Data Engineer Exam. Even if you don't plan to take the exam, these courses will help you gain a solid understanding of the various data processing components of the Google Cloud Platform.
At the heart of Google’s big data services is BigQuery, a managed data warehouse in the cloud. The first three courses will show you how to load and query data in BigQuery, optimize BigQuery’s performance, and visualize your data.
The next three courses will show you how to process your data. First, you will use Cloud Machine Learning Engine to train neural networks to perform predictive analytics. Next, you’ll use Cloud Dataflow and Cloud Dataproc to build data processing pipelines that transform and summarize your data using Apache Beam, Hadoop, and Spark.
The final course will introduce you to Bigtable, Google’s revolutionary NoSQL database. It will show you how to take advantage of Bigtable’s high performance for big data applications.
All of these courses include hands-on demos you can do yourself. Then you can test what you’ve learned by taking the practice exam.
Learning Objectives
- Design a data processing system
- Build and maintain data structures and databases
- Analyze data and enable machine learning
- Optimize data representations, data infrastructure performance, and cost
- Ensure reliability of data processing infrastructure
- Visualize data
- Design secure data processing systems
Prerequisites
- Basic database knowledge
Intended Audience
- Data professionals
- People studying for the Google Professional Data Engineer exam
Feedback
If you have thoughts or suggestions for this learning path, please contact Cloud Academy at support@cloudacademy.com.
Certificate

Learning Path Steps
This course introduces you to the fundamentals of Google Cloud Platform, including App Engine, Kubernetes Engine, Compute Engine, storage, BigQuery, Cloud Firestore, and app deployment.
In this lab, you will inspect data stored in Cloud Storage and understand the sensitive information therein.
This course covers Google Cloud systems operations, providing insight and practical information across the complete set of GCP features.
This course uses a case study to show how to apply the design principles of security, compliance, and disaster recovery to meet real-world requirements.
Granting Access to Google Cloud Storage Objects with Signed URLs
Use the gcloud CLI in Google Cloud Shell to create signed URLs to grant anyone access to objects stored in Google Cloud Storage for a set duration in this Lab.
This hands-on tutorial teaches you monitoring, testing, managing, and troubleshooting your GCP app infrastructure.
In this lab, you will create two tables in a SQL PostgreSQL database, perform operations on them, monitor the resources usage and test that the atomicity property is respected by the database.
Learn how to load data into BigQuery, run queries using standard SQL, and export data from BigQuery with this hands-on course.
This Lab will show you the basic concepts of BigQuery and will allow you to handle data and query them in a real GCP environment.
Learn how to make BigQuery faster, cheaper, and more secure with this hands-on course.
With this course, you'll learn how to visualize BigQuery Data with Google Data Studio and create BigQuery reports.
In this course, you'll learn how to train and deploy neural networks with Google AI Platform.
Learn how to build a CNN, train it on Machine Learning Engine and visualize its performance. Learn how to recognize overfitting and apply different methods to avoid it.
In this course, you'll learn how to write data processing programs using Apache Beam and run them using Cloud Dataflow, as well as learning how to run both batch and streaming jobs.
Google Cloud Pub/Sub is a message queuing service that allows you to deploy topics and attach subscriptions to them. Once a message is sent to the topic, it will send the message to all the attached subscriptions.
In this course, you'll learn how to run Hadoop and Spark jobs on GCP.
In this course, you'll learn which of your applications could make use of Bigtable and how to take advantage of its high performance.
In this lab, you will create and manage a Redis instance by using Google Cloud Memorystore.
This short video lists some of the other resources you should review before taking the Google Certified Professional Data Engineer exam.
Guy launched his first training website in 1995 and he's been helping people learn IT technologies ever since. He has been a sysadmin, instructor, sales engineer, IT manager, and entrepreneur. In his most recent venture, he founded and led a cloud-based training infrastructure company that provided virtual labs for some of the largest software vendors in the world. Guy’s passion is making complex technology easy to understand. His activities outside of work have included riding an elephant and skydiving (although not at the same time).