So far, we’ve discussed scenarios where we have categorical predictors. But what about when we have a continuous predictor? As long as our response variable is normally distributed, it’s a linear regression. As was mentioned previously, linear regression in R is just another form of lm().

For example, with linear regression, we can determine if the size at metamorphosis, SVL.final, is influenced by the length of the larval period, age.DPO. Once again, we’re going to use the log-transformed versions of these variables.

Using SVL as our response variable

We might be wondering why we’re going to use the data on the final SVL as our response variable since we saw in the previous chapter that log transformation didn’t make it normal. It improved the normality, but a Shapiro-Wilks test still said it was significantly unlikely that the data came from a normal distribution. There are two reasons why we use SVL as our response variable:

Biologically speaking, it doesn’t make sense to think that the size of the tadpole at metamorphosis affects the time it took to get to metamorphosis. Instead, we would more likely expect the relationship to go in the other causal direction.
Even if our data isn’t normal, we can always run a model and evaluate the fit using the diagnostic plots. The lm() function is highly robust, and if the model fits well, we’re in good shape. Let’s try it out!

Introduction to R

Thoughts on Proper Data Analysis

Exploratory Data Analysis and Data Summarization

Introduction to Plotting

Basic Statistical Analysis Using R

More Linear Models in R

Advanced Statistical Analysis Using R

Mixed-effects Model

Advanced Data Wrangling and Plotting

Writing Loops and Functions in R

Appendix

Conclusion

Linear Regression

Using SVL as our response variable