Sessionizing Clickstream Data with Kinesis Data Analytics
The hands-on lab is part of this learning path
Ready for the real environment experience?
Description
Kinesis Data Analytics is a fully-managed service that enables you to perform analysis using SQL and other tools on streaming data in real-time. Kinesis Data Analytics scales automatically to match your usage, there's no infrastructure to manage and you only pay for what you use.
Use-cases for Kinesis Data Analytics include:
- Streaming extract, transform and load (ETL) jobs
- Real-time log analysis
- Ad-tech and digital marketing analysis
Kinesis Data Analytics allows you to make use of existing and familiar SQL skills, it also integrates with other AWS services. You can deliver your results to any destination supported by Kinesis Data Streams or Kinesis Firehose, and use a Lambda function to deliver to external or un-managed destinations.
In this lab, you will learn how to use Kinesis Data Analytics to sessionize sample clickstream data and output it to DynamoDB using a Lambda.
Learning Objectives
This is a beginner-level lab, upon completion, you will be able to:
- Use Kinesis Data Analytics to analyze clickstream data
- Create a Lambda function that adds records to a DynamoDB table
- Configure Kinesis Data Analytics to send results to your Lambda function
Intended Audience
- Candidates for the AWS Certified Data Analytics Speciality exam
- Data Engineers
- Cloud Engineers
Prerequisites
Familiarity with Data Analytics, SQL, the Bash shell, and the Python programming language will be beneficial but is not required.
The following courses can be used to fulfill the prerequisites:
Andrew is a Labs Developer with previous experience in the Internet Service Provider, Audio Streaming, and CryptoCurrency industries. He has also been a DevOps Engineer and enjoys working with CI/CD and Kubernetes.
He holds the AWS Certified Developer - Associate and AWS Certified Sysops Administrator - Associate certifications.