Transform Raw Data to Transactional S3 Tables with Amazon Athena

CLOUD LABS

Transform Raw Data to Transactional S3 Tables with Amazon Athena

In this Cloud Lab, you’ll transform raw data stored in Amazon S3 into transactional S3 Tables using Apache Iceberg. You’ll define schemas, load and query data with Amazon Athena, and perform inserts, updates, deletes, and merges while exploring Iceberg snapshots and time travel.

8 Tasks

beginner

1hr 30m

Certificate of Completion

Desktop OnlyDevice is not compatible.

No Setup Required

Amazon Web Services

Learning Objectives

Working knowledge of creating and managing Iceberg tables in Amazon S3

An understanding of querying Iceberg tables using Amazon Athena SQL

Familiarity with performing inserts, updates, deletes, and merges on Iceberg tables

Experience with using partitioning to optimize query performance

Knowledge of Iceberg snapshots and time travel capabilities

Technologies

Athena

Desktop Only

No Setup Required

Amazon Web Services

Labs Rules Apply

Stay within resource usage requirements.

Do not engage in cryptocurrency mining.

Do not engage in or encourage activity that is illegal.

Cloud Lab Overview

In this Cloud Lab, you’ll learn how modern data platforms evolve beyond basic file storage to support transactional, structured, and queryable datasets. You’ll use Amazon S3 paired with Apache Iceberg to enable schema evolution, ACID transactions, and high-performance analytics directly on object storage. With Amazon Athena, you’ll run SQL queries without managing servers or infrastructure, making it easy to explore and analyze your Iceberg tables.

You’ll begin by transforming raw S3 data into Iceberg-backed tables and defining their schemas. You’ll then use Athena SQL to load and query the data and perform inserts, updates, deletes, and merges. Finally, you’ll explore Iceberg’s snapshot and time travel capabilities to track historical changes and compare past versions of your datasets. Together, these skills will help you build governed, analytics-ready data lakes on AWS using scalable, open table formats.

The following is the high-level architecture diagram of the infrastructure you’ll create in this Cloud Lab:

Cloud Lab Tasks

1.Introduction

Getting Started

2.Prerequisites

Create S3 Buckets

3.Prepare Data Using Athena

Query the S3 Data with Athena

Configure S3 Table Bucket

4.ACID Operations on S3 Tables

DML Queries on S3 Tables

Time Travel and Snapshot Comparison

5.Conclusion

Clean Up

Wrap Up

Labs Rules Apply

Stay within resource usage requirements.

Do not engage in cryptocurrency mining.

Do not engage in or encourage activity that is illegal.

Before you start...

Try these optional labs before starting this lab.

Cloud Lab

Create a Data Lake with Lake Formation and Analyze It with Athena

intermediate

1hr

Cloud Lab

Working with AWS S3 Cross-Region Replication

beginner

1hr

Cloud Lab

Analyzing S3 Data and CloudTrail Logs Using Amazon Athena

beginner

1hr 30m

Relevant Course

Use the following content to review prerequisites or explore specific concepts in detail.

Hear what others have to say

Join 1.4 million developers working at companies like

"Your method is simple, straight to the point and I can practice with it everywhere, even from my phone, that's something I have never had in other learning platforms."

Felipe Matheus

Software Engineer

"I highly recommend Educative. The courses are well organized and easy to understand."

Adina Ong

Senior Engineering Manager

"I prefer Educative courses because they have a nice mix of text & images. I find that with full video courses, it can often be too easy to go into passive learning mode."

Clifford Fajardo

Senior Software Engineer

"I love the content on Educative and I feel as if I am definitely improving in my craft."

Thomas Chang

Software Engineer

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments

Newsletter

Fenzo