Fundamentals of Machine Learning: A Pythonic Introduction/

...

K-Means Clustering

Learn about the k-means algorithm, its initialization, NP-hardness, and variance computation with examples.

We'll cover the following...

Objective
- Variance computation code example
K-means is NP-hard
Lloyd’s algorithm
K-means++
K-means variants
- K-medoids clustering
- K-center clustering

Traditionally, in machine learning, we start with the popular partitional clustering algorithm called $k$ -means clustering. This algorithm divides the data into $k$ clusters based on a similarity score. The objective is to minimize the total variance of the $k$ clusters. The number of clusters, $k$ , must be specified.

Note: The choice of similarity score is a hyperparameter.

Objective

Given a set $D=\{\bold x_1, \bold x_2, \dots, \bold x_n \}$ of $n$ data points in $\R^d$ , the goal is to partition $D$ into the given $k \ge 2$ partitions, say, $C_1, C_2, \dots, C_k$ ...

Course Overview

Supervised Learning

Detect Cyber Intrusion Using Machine Learning

Clustering

Project: Bag of Visual Words

Generalized Linear Regression

Face Recognition Using Kernel Linear Discriminant

Support Vector Machine

Logistic Regression

Ensemble Learning

Early Stage Diabetes Prediction Using Ensemble Learning

Decoding Dimensions: PCA and Autoencoders

Image Reconstruction Using PCA

Image Colorization using Autoencoders

Colorful Face Generation with VAEs

Appendix

Wrapping Up

How to Predict the Traffic Volume Using Machine Learning

K-Means Clustering

Objective