Data Modeling

Learn how to combine multiple datasets and data modeling strategies.

We'll cover the following

Data modeling is the process of combining related data from multiple sources that are related to the business use case. We often find ourselves in situations where we need to combine different datasets to extract value from them. Datasets act like pieces of a puzzle, and sometimes we need to combine them to see the entire picture.

In this lesson, we’ll cover some of the techniques often used in data modeling.

Merging

Let’s assume we have a retail business where transaction records come from a POS system and customer details are stored in a Customer Relationship Management (CRM) system. Say we’d like to perform a report where we want to see quarterly revenue per customer. In such a scenario, we must combine these two systems for a complete view. There are a couple of ways to achieve this in Pandas, including merging. Let’s see how it’s done.

Get hands-on with 1200+ tech skills courses.