How do Random Forests work?

The idea behind random forests is that a large number of uncorrelated decision trees working individually will perform better as a committee than any individual tree.

There are two important words in the above statement.

Uncorrelated trees
Performing as a committee

Uncorrelated trees

For a random forest model to perform nicely, the individual decision tree models need to have low correlation amongst themselves. Just like investments with low correlations, such as stocks and bonds, combine to form a portfolio greater than the sum of its parts, a random forest can produce predictions better than its individual trees. The reason is that all trees do not have the same error. Rather each tree has a different kind of error. They collectively move in a direction to reduce the total error. Therefore, the predictions made by individual trees need to have a low correlation.

To ensure that the outcomes of the individual decision trees are uncorrelated, random forests ...

What is Data Science

Python Basics

Handling Tabular Data in Python

Data Cleaning

Exploratory Data Analysis

Statistical Inference

Predictive Models

Machine Learning

How to Predict the Traffic Volume Using Machine Learning

Fundamentals of Data Science

Random Forests

Random Forests

How do Random Forests work?

Uncorrelated trees