Real-Time Anomaly Detection with Data Firehose and SageMaker

Real-Time Anomaly Detection with Data Firehose and SageMaker
Real-Time Anomaly Detection with Data Firehose and SageMaker

CLOUD LABS



Real-Time Anomaly Detection with Data Firehose and SageMaker

In this Cloud Lab, you’ll learn to build a real-time anomaly detection system using dynamic partitioning in Data Firehose and SageMaker.

8 Tasks

intermediate

1hr 30m

Certificate of Completion

Desktop OnlyDevice is not compatible.
No Setup Required
Amazon Web Services

Learning Objectives

An understanding of S3 dynamic partitioning in Data Firehose
Hands-on experience building a real-time anomaly detection system
Working knowledge of real-time stream ingestion using Data Firehose
Hands-on experience training an anomaly detection model using Random Cut Forest in SageMaker

Technologies
SageMaker
Lambda logoLambda
SNS logoSNS
Cloud Lab Overview

Amazon Data Firehose, formerly Kinesis Firehose, is used for real-time data streaming and enables dynamic partitioning. This approach automatically routes incoming data to S3 prefixes based on record attributes, improving data organization and significantly reducing downstream analytics costs and time.

In this Cloud Lab, you’ll build a real-time anomaly detection system using Data Firehose. You’ll start by training a model for anomaly detection using SageMaker. Then, you’ll set up a Data Firehose stream to ingest sensor data and store it in an S3 bucket. Moving on, you’ll create an SNS topic for sending email alert notifications in case an anomaly is detected. Finally, you’ll combine the application by creating the Lambda function, which is triggered when new data is added to the S3 bucket.

As you complete this lab, you’ll be well equipped to implement S3 dynamic partitioning in your applications, effectively organizing the data in S3 and reducing the costs of analytical queries.

The following is the high-level architecture diagram that you will build in this Cloud Lab:

Real-time sensor anomaly detection system using Amazon Data Firehose
Real-time sensor anomaly detection system using Amazon Data Firehose

Cloud Lab Tasks
1.Introduction
Getting Started
2.Train the ML Model
Create an S3 Bucket
Create a SageMaker Notebook
3.Set Up the Firehose Stream
Create the Firehose Stream
4.Put It All Together
Set Up an SNS Topic
Create the Lambda Function
5.Conclusion
Clean Up
Wrap Up
Labs Rules Apply
Stay within resource usage requirements.
Do not engage in cryptocurrency mining.
Do not engage in or encourage activity that is illegal.

Relevant Courses

Use the following content to review prerequisites or explore specific concepts in detail.

Hear what others have to say
Join 1.4 million developers working at companies like