Search⌘ K
AI Features

Clustering with PyCaret

Explore clustering methods using PyCaret, focusing on K-Means and other algorithms to segment datasets by shared features. Understand the setup, synthetic dataset generation, and model evaluation to build a foundational clustering skillset for unsupervised machine learning tasks.

One of the fundamental tasks in unsupervised machine learning is clustering. This task aims to categorize instances of a given dataset in different clusters based on their common characteristics. Clustering has many practical applications in various fields such as market research, social network analysis, bioinformatics, medicine, and others. The k-means clustering method is a simple and widely used method. It is defined in the following formula:

minC1,,CK{k=1KW(Ck)}\min_{C_{1}, \ldots, C_{K}}\left\{\sum_{k=1}^{K} W(C_{k})\right\}

KK is the number of all clusters, while CkC_{k} represents each cluster. Our goal is to minimize ...