Search⌘ K

Review: Data Science Pipeline

Explore the full data science pipeline by reviewing key concepts like data wrangling with dplyr, visualization using ggplot2, regression modeling, and statistical inference with the infer package. Understand how to think and tell stories with data, reinforcing foundational skills for practical analysis in R.

We'll cover the following...

We displayed the ModernDive flowchart mapping our journey throughout the whole course.

ModernDive flowchart
ModernDive flowchart

Let’s have a refresher of what has been covered so far. We started with data in the first chapter, where we learned about the difference between R and RStudio, started coding in R, installed and loaded our first R packages, and explored our first dataset. This dataset was about all domestic departure flights from a major New York City airport in 2013. Then we covered the following three parts of this course:

  • Data science with tidyverse: We assembled our data science toolbox using tidyverse packages. In particular, we:

    • Visualized data using the ggplot2 package

    • Wrangled data using the dplyr ...