We’ll begin exploring how to summarize our data in various ways, including calculating new variables as necessary. We’ve done something like this already, so hopefully, this serves as a refresher, reinforcement, and expansion of material that was introduced in earlier chapters.

Calculating treatment means is one of the most common things a scientist may need to do. This is useful for plotting and also for finding the average values across different categories of data. What’s the average effect of our experimental treatments? How tall are the plants in each species we’ve collected? What’s the level of expression for each of the genes in our RNA-seq dataset? These are the sorts of things we can answer once we summarize our data in some way.

Let’s start by calculating the mean age at emergence, in terms of days post-oviposition, for each of our predation treatments. This code has three steps:

We begin by declaring our original data frame, RxP.clean.
We set our grouping variable, Pred.
We define the new variable, Mean.Age.DPO, to calculate the mean using mean(Age.DPO).

Note that we pipe one line to the next using the %>% function at each step of the process.

Introduction to R

Thoughts on Proper Data Analysis

Exploratory Data Analysis and Data Summarization

Introduction to Plotting

Basic Statistical Analysis Using R

More Linear Models in R

Advanced Statistical Analysis Using R

Mixed-effects Model

Advanced Data Wrangling and Plotting

Writing Loops and Functions in R

Appendix

Conclusion

Basic Data Wrangling

Group and summarize data