Aggregating Data

This lesson introduces us to aggregation of data and focuses on how we can use entire columns to aggregate using Pandas.

Aggregation

Aggregation is the procedure of converting a large number of values, or a dataset, into a single value or quantity aimed to summarize or describe the data. During data analysis, we always want to summarize data in one way or another. When we take a sum, count the number of items, or take the average of some values we aggregate data. Common aggregation methods are:

  • sum
  • count
  • maximum
  • minimum
  • average

Aggregation is an essential step in analyzing data as it tells us the nature of the data in a single quantity. Let’s look at examples on our California Census Housing Dataset to see how this is true.

Get hands-on with 1400+ tech skills courses.