Building ETL Pipelines on AWS

CLOUD LABS

Building ETL Pipelines on AWS

In this Cloud Lab, you’ll learn how to create an ETL data pipeline with AWS Glue.

8 Tasks

intermediate

3hr

Certificate of Completion

Desktop OnlyDevice is not compatible.

No Setup Required

Amazon Web Services

Learning Objectives

A thorough understanding of AWS Glue ETL

The ability to set up a visual ETL pipeline

Hands-on experience performing ETL operations on a dataset

Technologies

DynamoDB

Glue

Desktop Only

No Setup Required

Amazon Web Services

Labs Rules Apply

Stay within resource usage requirements.

Do not engage in cryptocurrency mining.

Do not engage in or encourage activity that is illegal.

Cloud Lab Overview

AWS Glue is a serverless data integration service that makes it easier to discover, prepare, move, and integrate data from multiple sources. It provides ETL (extract, transform, load) service, which is a process used in data engineering to extract data from various sources, transform it into a desired format, and load it into a target data store for analysis, reporting, and business intelligence. AWS Glue simplifies the ETL process, making it easier for businesses to prepare and transform their data for analytics.

In this Cloud Lab, you’ll create a DynamoDB table as source data. You’ll set up a database in AWS Glue with the DynamoDB table as its source. After that, you’ll use the AWS Glue crawler to fetch metadata from the DynamoDB table and into Data Catalog tables in the Glue database. You’ll then set up an ETL pipeline in AWS Glue and extract data from the Glue database, perform transformations on the data, and load the resulting data in the S3 bucket.

After the completion of this Cloud Lab, the provisioned infrastructure will be similar to the one given below:

Cloud Lab Tasks

1.Introduction

Getting Started

2.Set Up the Data Stores

Create a DynamoDB Table

Configure and Run a Glue Crawler

Create an S3 bucket

3.Build ETL Pipeline

Create a Visual ETL Pipeline with AWS Glue

Configure and Run the ETL Job

4.Conclusion

Clean Up

Wrap Up

Labs Rules Apply

Stay within resource usage requirements.

Do not engage in cryptocurrency mining.

Do not engage in or encourage activity that is illegal.

Before you start...

Try these optional labs before starting this lab.

Cloud Lab

Analyzing S3 Data and CloudTrail Logs Using Amazon Athena

beginner

1hr 30m

Relevant Course

Use the following content to review prerequisites or explore specific concepts in detail.

Hear what others have to say

Join 1.4 million developers working at companies like

"Your method is simple, straight to the point and I can practice with it everywhere, even from my phone, that's something I have never had in other learning platforms."

Felipe Matheus

Software Engineer

"I highly recommend Educative. The courses are well organized and easy to understand."

Adina Ong

Senior Engineering Manager

"I prefer Educative courses because they have a nice mix of text & images. I find that with full video courses, it can often be too easy to go into passive learning mode."

Clifford Fajardo

Senior Software Engineer

"I love the content on Educative and I feel as if I am definitely improving in my craft."

Thomas Chang

Software Engineer

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

Newsletter