Calibration of Predicted Probabilities

Explore how to measure and improve the calibration of predicted probabilities in classification models. Understand expected calibration error, create decile bins, and visualize calibration plots to assess model accuracy and reliability for decision-making.

We'll cover the following...

Analyzing model calibration and accuracy
Try it yourself

Analyzing model calibration and accuracy

One interesting feature of the previous lesson's figureCaption: Plot of default rate and sample count for equal-interval bins is that the line plot of default rates increases by roughly the same amount from bin to bin. Contrast this to the decile plot in the previous lessonCaption: Default rate according to model prediction decile, where the default rate increases slowly at first and then more rapidly. Notice also that the default rate appears to be roughly the midpoint of the edges of predicted probability for each bin. This implies that the default rate is similar to the average model prediction in each bin. In other words, not only does our model appear to effectively rank borrowers from low to high risk of default, as quantified by the ROC AUC, but it also appears to accurately predict the probability of default.

Measuring how closely predicted probabilities match actual probabilities is the goal of calibrating probabilities. A standard measure for probability calibration follows from the concepts discussed above and is called expected calibration error (ECE), defined as

1.Introduction

2.Data Exploration and Cleaning

Mini Project

3.Introduction to scikit-learn and Model Evaluation

Project

Mini Project

4.Details of Logistic Regression and Feature Extraction

Mini Project

5.The Bias-Variance Trade-Off

Mini Project

6.Decision Trees and Random Forests

Mini Project

7.Gradient Boosting, XGBoost, and SHAP Values

Mini Project

Project

8.Test Set Analysis, Financial Insights, and Delivery to the Client

Mini Project

9.Appendix

Calibration of Predicted Probabilities

Analyzing model calibration and accuracy