...

Discretizing

Learn how to discretize variables.

We'll cover the following...

The KBinsDiscretizer method
The QuantileTransformer method
Conclusion

Press + to interact

In addition to potentially helping with interpretation, this technique can be used to reduce the memory and computational requirements of models, especially in resource-constrained environments, such as mobile devices or embedded systems.

The scikit-learn methods for discretizing features include KBinsDiscretizer and QuantileTransformer.

The `KBinsDiscretizer` method

The KBinsDiscretizer method discretizes continuous features into a specified number of bins. The following code demonstrates how to use the KBinsDiscretizer method in scikit-learn:

Press + to interact

Python 3.8

import numpy as np
from sklearn.preprocessing import KBinsDiscretizer
# Define the numerical variables
X = np.array(
    [[1.9, 2.8, 6],
    [4.7, 5.6, 8],
    [0.1, 2.8, 12],
    [0.4, 8.2, 99]]
    )
# Create the KBinsDiscretizer object
discretizer = KBinsDiscretizer(n_bins=3, encode='ordinal')
# Transform the numerical variables
X_discretized = discretizer.fit_transform(X)
# Print the original variables and the resulting discretized variables
print("Original: \n",X)
print("Discretized: \n",X_discretized.round(2))

Course Overview

Introduction to Machine Learning

Preprocessing

Supervised Learning

Unsupervised Learning

Model Evaluation

How to Predict the Traffic Volume Using Machine Learning

Tips and Tricks

Conclusion

Customer Segmentation with K-Means Clustering

Discretizing

The `KBinsDiscretizer` method

How to Predict the Traffic Volume Using Machine Learning

Customer Segmentation with K-Means Clustering

Discretizing

The KBinsDiscretizer method

The `KBinsDiscretizer` method