CloudAcademy

Amazon Athena - Introduction

The course is part of this learning path

AWS Big Data – Specialty Certification Preparation for AWS
course-steps 14 lab-steps 4 quiz-steps 3

Contents

keyboard_tab
play-arrow
Start course
Overview
DifficultyIntermediate
Duration32m
Students507

Description

In this course, we will perform an in-depth review of the Amazon Athena service. We will review and explain fundamental AWS Athena storage and querying concepts. We will highlight suitable use cases in which Athena can be applied effectively. You will be introduced to the basic underlying technology that Athena has been built on. We spend time discussing the process of creating and setting up Athena databases, tables, and partitions. We examine the process in which Athena SQL queries are authored and how they are managed. We review current Athena limitations and pricing. Finally, we will provide a demonstration in which we publish CloudTrail logs into an S3 bucket. We make some ad-hoc security group changes to generate a few CloudTrail events - and finally we’ll use Athena to search and find the captured security group API update calls.

Transcript

- [Jeremy] Hello and welcome to the Cloud Academy course on Amazon Athena. Before we start I'd like to introduce myself. My name is Jeremy Cook. I'm one of the trainers here at Cloud Academy specializing in AWS. Feel free to connect with either myself or the wider team here at Cloud Academy regarding anything about this course. You can email us at support@cloudacademy.com. Alternatively our online community forum is available for your feedback. In this training course you'll be introduced to Amazon Athena. Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Let's quickly review the Amazon Athena offering. It allows you to quickly query unstructured, semi-structured, and structured data stored in S3. It's a server-less based service. You only pay for the data scanned. Built on Presto technology and open source distributed SQL query engine. Provides and SQL like dialect for data querying. Designed to provide fast performance for large data set scanning. The agenda for the remainder of this course is as follows. We'll review and explain fundamental AWS Athena storage and querying concepts. We'll highlight suitable use cases in which Athena can be applied effectively. We'll review the basic underlying technology that Athena has been built on. We'll discuss and illustrate the process of creating and setting up Athena tables, formats, and partitions. We'll examine the process in which Athena SQL queries are offered. We'll review current Athena limitations and pricing. Finally we'll provide a demonstration in which we publish CloudTrail logs into an S3 bucket. We'll make some ad-hoc security group changes to generate a few CloudTrail events. Finally, we'll use Athena to search and find the captured security group API update calls. The following prerequisites will be both useful and helpful for this course. General big data and analytical concepts; Amazon Simple Storage Service, S3; SQL, Structured Query Language. Furthermore if you require an introduction to S3 then please consider taking the free S3 lab hosted here on Cloud Academy.

About the Author

Students5697
Labs6
Courses41
Learning paths8

Jeremy is a Cloud Researcher and Trainer at Cloud Academy where he specializes in developing technical training documentation for security, AI, and machine learning for both AWS and GCP cloud platforms.

He has a strong background in development and coding, and has been hacking with various languages, frameworks, and systems for the past 20+ years.

In recent times, Jeremy has been focused on Cloud, Security, AI, Machine Learning, DevOps, Infrastructure as Code, and CICD.

Jeremy holds professional certifications for both AWS and GCP platforms.