K-Means on n-Dimensional Data

This lesson will focus on K-Means on n-dimensional data in Python.

We'll cover the following

K-means on n-dimensional data

For clustering n-dimensional data, we can use the same procedure. The only difference is that we first need to reduce the number of dimensions to 2. For reducing dimensions, we will use a technique called Principal Component Analysis (PCA). We do not need to go into the details of how this technique works since the details are out of the scope of this course. We can reduce data to 2 dimensions by using the PCA class available in sklearn.decomposition. We will cluster the reduced data.

We will be using the Default of Credit Card Dataset. This dataset originally had two classes. Naturally, if we do not have the class labels, we would make two clusters for this example.

Get hands-on with 1200+ tech skills courses.