Understanding Data File Formats

1m 42s

This lesson explores various data file formats that are used for data analytics, big data, and machine learning. So this lesson is ideal for you if you're looking to understand which file type you should use for your big data or analytic pipelines and make a decision on which file type is right for your workload.

Learning Objectives

  • Understand the pros and cons of Apache ORC, Apache Parquet, AVRO, CSV, and JSON file types
  • Learn which data file format best suits your needs

Intended Audience

This lesson is for anyone who wants to learn about data formats and file types, and which ones are right for their workloads.


To get the most out of this lesson, you should have some background knowledge of databases, data information systems, and data files.

About the Author
Will Meadows, opens in a new tab
Senior Content Developer

William Meadows is a passionately curious human currently living in the Bay Area in California. His career has included working with lasers, teaching teenagers how to code, and creating classes about cloud technology that are taught all over the world. His dedication to completing goals and helping others is what brings meaning to his life. In his free time, he enjoys reading Reddit, playing video games, and writing books.

Covered Topics