Inconsistent Data

This lesson will focus on some of the common inconsistencies present in datasets and how to deal with them using pandas.

Inconsistency in data arises due to errors in collecting data. For instance, if the data was collected from multiple sources, or if the data was collected by multiple people who did not follow the same format of collecting data, then there is a high chance of inconsistencies in the data.

In this lesson, we will be cleaning the Credit Cards Default Dataset. This dataset is a very good example of the kind of inconsistencies that are present in most datasets.

Get hands-on with 1200+ tech skills courses.