Aggregating Data

This lesson introduces us to aggregation of data and focuses on how we can use entire columns to aggregate using Pandas.

We'll cover the following

Aggregation

Aggregation is the procedure of converting a large number of values, or a dataset, into a single value or quantity aimed to summarize or describe the data. During data analysis, we always want to summarize data in one way or another. When we take a sum, count the number of items, or take the average of some values we aggregate data. Common aggregation methods are:

  • sum
  • count
  • maximum
  • minimum
  • average

Aggregation is an essential step in analyzing data as it tells us the nature of the data in a single quantity. Let’s look at examples on our California Census Housing Dataset to see how this is true.

Get hands-on with 1200+ tech skills courses.