Scalable Data Lake

Learn about data lakes and how they can be architected in AWS.

A data lake is a centralized location for storing data that has been ingested from various places. The term was coined around 2011 to distinguish it from other forms of centralized data storage. Others creatively coined the term “data swamps” to describe badly managed data lakes.

In this lesson, we consider the AWS approach for setting up a data lake and how a data lake differs from data warehouses and production data stores.

