Transform Raw Data to Transactional S3 Tables with Amazon Athena

Transform Raw Data to Transactional S3 Tables with Amazon Athena
Transform Raw Data to Transactional S3 Tables with Amazon Athena

CLOUD LABS



Transform Raw Data to Transactional S3 Tables with Amazon Athena

In this Cloud Lab, you’ll transform raw data stored in Amazon S3 into transactional S3 Tables using Apache Iceberg. You’ll define schemas, load and query data with Amazon Athena, and perform inserts, updates, deletes, and merges while exploring Iceberg snapshots and time travel.

8 Tasks

beginner

1hr 30m

Certificate of Completion

Desktop OnlyDevice is not compatible.
No Setup Required
Amazon Web Services

Learning Objectives

Working knowledge of creating and managing Iceberg tables in Amazon S3
An understanding of querying Iceberg tables using Amazon Athena SQL
Familiarity with performing inserts, updates, deletes, and merges on Iceberg tables
Experience with using partitioning to optimize query performance
Knowledge of Iceberg snapshots and time travel capabilities

Technologies
S3 logoS3
Athena
Cloud Lab Overview

In this Cloud Lab, you’ll learn how modern data platforms evolve beyond basic file storage to support transactional, structured, and queryable datasets. You’ll use Amazon S3 paired with Apache Iceberg to enable schema evolution, ACID transactions, and high-performance analytics directly on object storage. With Amazon Athena, you’ll run SQL queries without managing servers or infrastructure, making it easy to explore and analyze your Iceberg tables.

You’ll begin by transforming raw S3 data into Iceberg-backed tables and defining their schemas. You’ll then use Athena SQL to load and query the data and perform inserts, updates, deletes, and merges. Finally, you’ll explore Iceberg’s snapshot and time travel capabilities to track historical changes and compare past versions of your datasets. Together, these skills will help you build governed, analytics-ready data lakes on AWS using scalable, open table formats.

The following is the high-level architecture diagram of the infrastructure you’ll create in this Cloud Lab:

Transactional S3 Table with Amazon Athena
Transactional S3 Table with Amazon Athena
Cloud Lab Tasks
1.Introduction
Getting Started
2.Prerequisites
Create S3 Buckets
3.Prepare Data Using Athena
Query the S3 Data with Athena
Configure S3 Table Bucket
4.ACID Operations on S3 Tables
DML Queries on S3 Tables
Time Travel and Snapshot Comparison
5.Conclusion
Clean Up
Wrap Up
Labs Rules Apply
Stay within resource usage requirements.
Do not engage in cryptocurrency mining.
Do not engage in or encourage activity that is illegal.
Hear what others have to say
Join 1.4 million developers working at companies like