There are many tools in R to help us summarize our data efficiently. There are two functions worth knowing about at this stage:

The aggregate() function from base R.
The summarize() function from the dplyr package.

The `aggregate()` function

The aggregate() function takes a column of raw data and summarizes it across one or more groups based on some chosen function—for example, calculating the mean or the standard deviation. One nice thing about using aggregate() is that we code the function of how we want our data summarized in the exact same format that we use for specifying plots or models. The output of the aggregate() function is a data frame, which is then easy to use for plotting figures or other purposes. The aggregate() function takes three arguments:

It takes the response and predictor variables.
It takes the function we want to execute with the FUN= argument.
It takes the data frame where the data can be found.

For example, if we want to calculate the mean size of metamorphs at emergence across all combinations of predator and resource treatments, we type the following:

Introduction to R

Thoughts on Proper Data Analysis

Exploratory Data Analysis and Data Summarization

Introduction to Plotting

Basic Statistical Analysis Using R

More Linear Models in R

Advanced Statistical Analysis Using R

Mixed-effects Model

Advanced Data Wrangling and Plotting

Writing Loops and Functions in R

Appendix

Conclusion

Summarizing and Manipulating Data

The `aggregate()` function

Introduction to R

Thoughts on Proper Data Analysis

Exploratory Data Analysis and Data Summarization

Introduction to Plotting

Basic Statistical Analysis Using R

More Linear Models in R

Advanced Statistical Analysis Using R

Mixed-effects Model

Advanced Data Wrangling and Plotting

Writing Loops and Functions in R

Appendix

Conclusion

Summarizing and Manipulating Data

The aggregate() function

The `aggregate()` function