The Basics of Data Management, Data Manipulation and Data Modelling

Developed with Calculated Systems
Calculated Systems
This content is developed in partnership with Calculated Systems
DifficultyBeginner
AVG Duration7h
Students2051
Ratings
5/5
starstarstarstarstar
Content
213

Description

Take this learning path to get the basics of everything data-related. Learn about data sources, data formats, databases, and SQL.

This learning path focuses on understanding common data formats and interfaces. It explores some common data formats that you'll encounter as a data engineer and you'll get a deep understanding of the pros and cons of storing your data in different ways. You'll also learn how data sets can be accessed and viewed differently if you were to just simply store it in a different fashion. After that, we'll move on to SQL, starting with a brief overview and history of the SQL standard. You'll then learn how to read and write data using structures created with the language SQL. 

This learning path comes complete with hands-on labs so that you can put your newly-acquired knowledge into practice, and also includes a final exam, so you can assess your understanding of the topics covered.

If you have any feedback relating to this learning path, please contact us at support@cloudacademy.com

Learning Objectives

  • Learn about different data sources and formats, and how to model your data
  • Get acquainted with the common data formats — CSV, XML, and JSON — as well as specialized data formats
  • Learn about databases and how to exchange data between applications
  • Recognize and explain the structured query language
  • Understand how to read and write SQL commands

Intended Audience

  • Aspiring data engineers
  • Anyone looking to gain a practical, hands-on understanding of data modeling and for those who might want to change how they're storing their data

Prerequisites

To get the most out of this learning path, you should have a basic understanding of computing services and a high-level understanding of databases. Knowledge of what a CSV and a JSON are would also be beneficial.

Certificate

Your certificate for this learning path

Training Content

1
Course - Beginner - 35m
Working With Data Sets
This course focuses on understanding common data formats and interfaces. It explores some common data formats that you'll encounter as a data engineer.
2
Hands-on Lab - Beginner - 1h
Acquiring and Storing Data in Python
This lab is aimed at students, with a moderate understanding of Python, who want to understand how to query an API, manipulate the data and store that data into a database with a more advanced schema.
3
Course - Beginner - 1h 4m
Introduction to SQL
In this Course, you will learn how to read and write data using structures created with the language SQL.
4
Hands-on Lab - Beginner - 8h
SQL Language
In this lab, you will complete a series of exercises that cover all the important components of the language and allow you to practice your SQL.
5
Hands-on Lab - Beginner - 1h
Run Your First SQL Queries
In this lab, you will practice the basics of SQL by connecting to a Microsoft SQL Server instance and then creating, managing and viewing data using a combination of user interface tools and SQL queries.
6
Exam - 20m
Knowledge Check: Introduction to Databases - Working with Datasets
Knowledge Check: Introduction to Databases - Working with Datasets
About the Author
Students22804
Labs31
Courses13
Learning paths35

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity.  With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing  decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.