Understanding the Role of Data Manipulation Skills

Learn how to describe and analyze datasets using pandas.

In practical situations, we rarely have our data in the format that we want. We usually have different datasets that we want to merge, and often, we need to normalize and clean up the data. For these reasons, data manipulation and preparation will always play a big part in any data visualization process. So, we will be focusing on this in this chapter and throughout the course.

The plan for preparing our dataset is roughly the following:

  • Explore the different files one by one.
  • Check the available data and data types and explore how each can help us categorize and analyze the data.
  • Reshape the data where required.
  • Combine different DataFrames to add more ways to describe our data.

Let’s go through these steps right away.

Exploring the data files

We start by reading in the files in the data folder.

Get hands-on with 1200+ tech skills courses.