Moving forward, let's check how the penalties affect the performance of our models. Using the (X, y) dataset, we have not seen many benefits of regularization. We can try the second dataset (X_overfit, y_overfit) and see the effect of all three types of regularization and compare the results to learn how regularization helps us control overfitting. Let's start with linear regression without regularization.

Python 3.8

from sklearn.linear_model import LinearRegression
from sklearn.model_selection import cross_val_score
lr_model = LinearRegression()
lr_cv_mean_mse = -cross_val_score(estimator=lr_model, X=X_overfit, y=y_overfit,
                                  cv=5, scoring='neg_mean_squared_error').mean()
lr_cv_mean_r2 = cross_val_score(estimator=lr_model, X=X_overfit, y=y_overfit, 
                                cv=5, scoring='r2').mean()
print("These are results from linear regression (cv=5) without regularization:")
print("The Ridge CV mean MSE: ",lr_cv_mean_mse)
print("The Ridge CV mean R^2: ", lr_cv_mean_r2)

1.Course Introduction

2.Linear Regression

3.Regularization

4.Bias-Variance Trade-off

5.Categorical Features

6.Logistic Regression

7.Logistic Regression: Titanic Data

Project

Sentiment Analysis Using Multinomial Logistic Regression

8.Multiclass Classification and Handling Imbalanced Classes

9.Project: Predicting Chronic Kidney Disease

10.K-Nearest Neighbors

11.Implementation of K-Nearest Neighbors

12.Logistic Regression vs. KNN

13.Decision Tree Learning

Project

Implement the Decision Tree Classifier from Scratch

14.Bootstrapping and Confidence Interval

15.Support Vector Machine

16.Practice and Comparisons

17.What's Next?

18.Appendix

Model Performance

Ridge regression