learning path

Wrestling with Data

Intermediate
6h 32m
1,153
5/5
Enhance your skill setDevelop essential skills for thriving in real-world scenarios.
Stay focused, stay committedBoost your learning journey by enrolling: stay focused, consistent and achieve your goals with ease.
Earn a certificate of completionShow your skills and build your credibility when you include them in your resume and LinkedIn profile.
Training content
2
6
1
See all

In this course, we dive into the various tools and techniques available for manipulating information and data sources. We then show you how you can use this knowledge to actually solve some real-world problems.

Intended Audience

If you are trying to handle increasingly complex data sets or want to increase your knowledge as a professional data engineer, this is a great course to get a practical field-based understanding.

Learning Objectives

  • Learn to determine when it's appropriate to use a programmatic approach versus pure SQL.
  • How to access and manipulate your files and data sources using programming techniques available to you in languages such as Python.
  • Learn how to use regular expressions to manipulate data and solve common data issues.

Prerequisites

  •  Familiarity with relational databases and other data formats such as CSVs and JSON.
  •  Baseline understanding of SQL

 If you don't have all of these this course will still benefit you, but you might not be able to follow all of the examples.  

If you have any feedback relating to this content, feel free to reach out to us at support@cloudacademy.com.

Your certificate for this learning path
Calculated Systems
This content is developed in partnership with Calculated Systems
Learn more
About the Author
Avatar
Calculated Systems
Training Provider
Students
31,457
Labs
31
Courses
13
Learning paths
42

Calculated Systems was founded by experts in Hadoop, Google Cloud and AWS. Calculated Systems enables code-free capture, mapping and transformation of data in the cloud based on Apache NiFi, an open source project originally developed within the NSA. Calculated Systems accelerates time to market for new innovations while maintaining data integrity.  With cloud automation tools, deep industry expertise, and experience productionalizing workloads development cycles are cut down to a fraction of their normal time. The ability to quickly develop large scale data ingestion and processing  decreases the risk companies face in long development cycles. Calculated Systems is one of the industry leaders in Big Data transformation and education of these complex technologies.

Covered Topics