Fundamentals of Machine Learning for Software Engineers/

...

Overfitting Explained

Learn what overfitting is, the major reasons behind overfitting, and how it differs from underfitting.

We'll cover the following...

About this chapter
The causes of overfitting
- Comparison of the loss of the two models
- Higher-dimensional data
Overfitting versus underfitting

About this chapter

Overfitting has been creating problems throughout the course. In this chapter, we’ll finally solve the problem of overfitting in our network.

To refresh our memory, a system that overfits is like a student who learns by rote memory. They might be good at solving familiar problems from textbooks, but they will struggle when confronted with new problems. Likewise, an overfitting system could be useful at classifying its training data, and then fail when classifying data it has not seen before.

One strategy to work around overfitting is to split our data into training, validation, and test sets. We should use the training set to train the system, the validation set to tune its performance, and the test set for a final check-up. That way, we can test the system on previously unseen data and get a reliable, overfitting-free measure.

That testing strategy works, but it’s short term. It does not eliminate overfitting.It just prevents overfitting from polluting our metrics. Unfortunately, overfitting has worse consequences than imprecise metrics. Like that student mentioned earlier, an overfitting system is good at memorizing but bad at generalizing. We experience such problems when we build a deep network. That network reaches perfect accuracy on the training set, but it does worse than its shallow counterpart on the validation set.

We’ll now investigate the causes of overfitting and its subtler consequences. Later on, we’ll apply a few methods to solve overfitting by regularization techniques. With those techniques, we’ll finally deliver the power of our deep neural network.

Before we talk about reducing overfitting, let’s get to know the concept of overfitting in detail. In this lesson, ...

How Machine Learning Works

Our First Learning Program

Walking the Gradient

Hyperspace

A Discern Machine

Get Real

The Final Challenge

The Perceptron

Designing the Network

Building the Network

Training the Network

How Classifiers Work

Batchin’ Up

The Zen of Testing

Let’s Do Development

A Deeper Kind of Network

Diabetes Prediction Using Keras

Defeating Overfitting

Taming Deep Networks

Beyond Vanilla Networks

Into the Deep

Recognize Handwritten Digits Using a Deep Neural Network

Machine Learning Fundamentals

Overfitting Explained

About this chapter