Sessionizing Clickstream Data with Amazon Kinesis Data Analytics

Lab Steps

lock
Logging in to the Amazon Web Services Console
lock
Creating an Amazon Kinesis Data Analytics Application
lock
Connecting to the Virtual Machine using EC2 Instance Connect
lock
Simulating a Real-Time Clickstream
lock
Sessionizing the Clickstream Data using Amazon Kinesis Data Analytics
lock
Creating an AWS Lambda function to Store Sessions in an Amazon DynamoDB Table
lock
Configuring Amazon Kinesis Data Analytics to Use Your AWS Lambda Function as a Destination

Ready for the real environment experience?

DifficultyBeginner
Time Limit1h 30m
Students1837
Ratings
4.8/5
starstarstarstarstar-half

Description

Amazon Kinesis Data Analytics is a fully-managed service that enables you to perform analysis using SQL and other tools on streaming data in real-time. Kinesis Data Analytics scales automatically to match your usage, there's no infrastructure to manage and you only pay for what you use.

Use-cases for Kinesis Data Analytics include:

  • Streaming extract, transform, and load (ETL) jobs
  • Real-time log analysis
  • Ad-tech and digital marketing analysis

Kinesis Data Analytics allows you to make use of existing and familiar SQL skills, it also integrates with other AWS services. You can deliver your results to any destination supported by Kinesis Data Streams or Kinesis Firehose, and use a Lambda function to deliver to external or un-managed destinations.

In this lab, you will learn how to use Kinesis Data Analytics to sessionize sample clickstream data and output it to DynamoDB using a Lambda.

Learning Objectives

This is a beginner-level lab, upon completion, you will be able to:

  • Use Amazon Kinesis Data Analytics to analyze clickstream data
  • Create an AWS Lambda function that adds records to an Amazon DynamoDB table
  • Configure Amazon Kinesis Data Analytics to send results to your AWS Lambda function

Intended Audience

  • Candidates for the AWS Certified Data Analytics Speciality exam
  • Data Engineers
  • Cloud Engineers

Prerequisites

Familiarity with Data Analytics, SQL, the Bash shell, and the Python programming language will be beneficial but is not required.

The following courses can be used to fulfill the prerequisites:

Updates

March 12th, 2023 - Resolved an issue that caused the lab to fail to set up on rare occasions

October 14th, 2022 - Updated instructions due to updates in the Kinesis Data Stream data retention timeframes

September 8th, 2022 - Updated the instructions and screenshots to the reflect UI

May 31st, 2022 - Updated screenshots & instructions to reflect UI

December 8th, 2021 - Updated lab step instructions to correct grammar error

December 6th, 2021 - Updated lab step instructions to reflect latest user interface changes

September 7th, 2021 - Updated lab step instructions to reflect latest user interface changes 

March 11th, 2021 - Updated AWS Lambda lab step to reflect latest user interface changes

January 22nd, 2021 - Updated AWS Lambda lab step to reflect latest user interface changes

Environment before
PREVIEW
arrow_forward
Environment after
PREVIEW
About the Author
Students54972
Labs139
Courses2
Learning paths3

Andrew is a Labs Developer with previous experience in the Internet Service Provider, Audio Streaming, and CryptoCurrency industries. He has also been a DevOps Engineer and enjoys working with CI/CD and Kubernetes.

He holds multiple AWS certifications including Solutions Architect Associate and Professional.