Introduction to Grouping

In this lesson, you will be introduced to the concept of grouping related items in Pandas.

Concept

The ability to group or segment DataFrames by one or more columns is one of the key features of any data analysis application. Therefore it would most likely show up in a data analysis interview or task.

The idea is to divide a DataFrame into multiple groups to analyze each group separately.

Syntax

As a reminder, the syntax is as simple as df.groupby(<col_name>) or, in the case of grouping by multiple columns, df.groupby([<col1>, <col2>, ..]).

Operations such as aggregations and apply functions can be applied on DataFrameGroupBy objects which can be reset to a normal DataFrame using reset_index()

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy