In Python, pandas is an open-source library providing high-performance, easy-to-use data structures and data analysis tools. Today, pandas is actively supported by a community of like-minded individuals around the world who contribute their valuable time and energy to help make pandas open source.

In this section of the course, we’ll learn to use pandas for data analysis. To compare pandas with a familiar program, we can think of pandas as being an extremely powerful version of Excel with a lot more features. We’ll cover the following key concepts in this lesson:

  • Data structures—Series and DataFrame.
  • Indexing and selection.
  • Hierarchical indexing.
  • Data cleaning and preparation.
  • Handling missing data.
  • Data wrangling—merging/joining, combining/concatenation.
  • Data aggregation and GroupBy.

Get hands-on with 1200+ tech skills courses.