Course Summary

Intermediate
1m 4s
886
4.9/5

In this lesson, we're going to do a deep dive into the various tools and techniques available for manipulating information and data sources along with showing you at the end of it how you can actually solve some real-world problems.

If you are trying to handle increasingly complex data sets and round out your experience as a professional data engineer, this is a great lesson to get a practical field-based understanding.

Learning Objectives

  • Learn to determine when it's appropriate to use a programmatic approach versus pure SQL.
  • How to access and manipulate your files and data sources using programming techniques available to you in languages such as Python.

Prerequisites

  •  Familiarity with relational databases and other data formats such as CSVs and JSON.
  •  Baseline understanding of SQL

If you don't have all of these this lesson will still benefit you, but you might not be able to follow all of the examples. 

About the Author
Students
32,953
Labs
31
Courses
13
Learning paths
42

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity.  With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing  decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.

Covered Topics