An Introductory Guide to Data Science and Machine Learning/

...

K-Means Clustering

In this lesson, you’ll learn about clustering algorithms, which form the core of unsupervised learning and involve grouping similar items together.

We'll cover the following...

Clustering
K-means clustering
How does the K-means algorithm work
Drawbacks of K-Means clustering

Clustering

Clustering is a famous unsupervised learning technique. It involves making clusters or groups of items such that the items in the same cluster are more similar to each other than items in the other cluster. In this lesson, we will be looking into K-means clustering.

K-means clustering

K-Means clustering as the name suggests, looks for a fixed number of clusters ( $K$ ) in the dataset. The mean or center of the $K^{th}$ cluster is represented by $\mu_{k}$ , which is also called the cluster centroid or average point. K-means relies on the idea of similarity/dissimilarity when assigning instances to respective clusters.

Similarity can also be thought of as proximity. It is a numerical measure of how alike two data instances are. Cosine Similarity is one of the most commonly used similarity measures. It takes values between 0 and 1, where a higher value indicates more similar instances. Cosine similarity between two feature vectors of $x = (x_1, x_2, ... , x_n)$ and $y = (y_1, y_2, ..., y_n)$ ...

What is Data Science ?

Applications of Data Science

Overview of Libraries

Probability and Statistics

Machine Learning Part-1

Machine Learning Part-2

Machine Learning Part-3

Deep Learning

Machine Learning Tools and Libraries

Big Data Tools and Technologies

Where to go next ?

K-Means Clustering

Clustering

K-means clustering