Cross-Validation

Explore the concept of cross-validation and understand its role in selecting hyperparameters like the regularization parameter for logistic regression. Learn how to use scikit-learn's KFold and StratifiedKFold to create training and test folds that help evaluate model performance reliably. Understand the importance of data splitting, shuffling, stratification, and how these practices reduce overfitting and improve model generalization. Gain practical skills for implementing k-fold and leave-one-out cross-validation to optimize predictive models.

We'll cover the following...

Choosing the regularization parameter
- Estimating hyperparameters
- A comparison of different models
  - Data management best practices
Scikit-learn cross-validation functions
Leave-one-out cross-validation for small datasets

1.Introduction

2.Data Exploration and Cleaning

Mini Project

3.Introduction to scikit-learn and Model Evaluation

Project

Mini Project

4.Details of Logistic Regression and Feature Extraction

Mini Project

5.The Bias-Variance Trade-Off

Mini Project

6.Decision Trees and Random Forests

Mini Project

7.Gradient Boosting, XGBoost, and SHAP Values

Mini Project

Project

8.Test Set Analysis, Financial Insights, and Delivery to the Client

Mini Project

9.Appendix

Cross-Validation

Choosing the regularization parameter