Data Sources and Formats

Beginner

4,389

4.8/5

This lesson focuses on understanding common data formats and interfaces. It explores some common data formats that you'll encounter as a data engineer. Basically, the goal is to develop a deep understanding of what the pros and cons of storing your data in different ways is. We're then going to focus on how to translate that high-level ethereal concept into a more concrete understanding and really showcase how the same dataset can be accessed and viewed differently if you were to just simply store it in a different fashion.

If you have any feedback relating to this lesson, please contact us at support@cloudacademy.com.

Learning Objectives

Learn about different data sources and formats, and how to model your data
Get acquainted with the common data formats — CSV, XLM, and JSON — as well as specialized data formats
Learn about databases and how to exchange data between applications

Intended Audience

This lesson is suited to anyone looking to gain a practical, hands-on understanding of data modeling and for those who might want to change how they're storing their data.

Prerequisites

To get the most out of this lesson, you should familiarize yourself with the concepts of what a CSV and a JSON is, along with databases at a high level.

About the Author

Calculated Systems, opens in a new tab

Training Provider

Students

32,108

Labs

Courses

Learning paths

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity. With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.

Covered Topics

Data Engineering

Databases

Data Sources and Formats

Learning Objectives

Intended Audience

Prerequisites

SOLUTIONS

CERTIFICATIONS

TRAINING LIBRARY

RESOURCES

PAST EVENTS

COURSE INDEX