Review: Data Science Pipeline
Explore the full data science pipeline by reviewing key concepts like data wrangling with dplyr, visualization using ggplot2, regression modeling, and statistical inference with the infer package. Understand how to think and tell stories with data, reinforcing foundational skills for practical analysis in R.
We'll cover the following...
We displayed the ModernDive flowchart mapping our journey throughout the whole course.
Let’s have a refresher of what has been covered so far. We started with data in the first chapter, where we learned about the difference between R and RStudio, started coding in R, installed and loaded our first R packages, and explored our first dataset. This dataset was about all domestic departure flights from a major New York City airport in 2013. Then we covered the following three parts of this course:
Data science with
tidyverse: We assembled our data science toolbox usingtidyversepackages. In particular, we:Visualized data using the
ggplot2packageWrangled data using the
dplyr...