Linearly separable case

Given a binary classification dataset $D=\{(\bold x_1, y_1), (\bold x_2, y_2), \dots,(\bold x_n, y_n)\}$ , where $\bold x_i \in \R^d$ and $y_i \in \{-1, 1\}$ , if a hyperplane $\bold w^T\phi(\bold x)=0$ exists that separates the two classes, the dataset is said to be linearly separable in the feature space defined by the mapping $\phi$ . We’ll assume for now that the dataset $D$ is linearly separable. The goal is to find the hyperplane with a maximum margin by optimizing the following objective:

\max_{\bold w}\bigg[\frac{1}{\|\bold w\|}\min_{i}\big(y_i\bold w^T\phi(\bold x_i)\big)\bigg]

The direct optimization of the above objective can be very complex. Here’s a derivation of an equivalent optimization problem which is easier to solve:

\begin{aligned} \max_{w} [\frac{1}{∥ w ∥} \min_{i} (y_{i} w^{T} ϕ (x_{i}))] & = \max_{w} \frac{1}{∥ w ∥} [\min_{i} (y_{i} w^{T} ϕ (x_{i}))] \\ = \max_{w, γ} \frac{γ}{∥ w ∥} s . t . y_{i} w^{T} ϕ (x_{i}) \geq γ \forall_{i} \\ = \max_{w, γ} \frac{1}{∥ w / γ ∥} s . t . y_{i} (w / γ)^{T} ϕ (x_{i}) \geq 1 \forall_{i} \\ = \max_{w^{^{'}}} \frac{1}{∥ w^{^{'}} ∥} s . t . y_{i} (w^{^{'} T} ϕ (x_{i}) \geq 1 \forall_{i} \\ = \min_{w^{^{'}}} ∥ w^{^{'}} ∥ s . \end{aligned}

Course Overview

Supervised Learning

Detect Cyber Intrusion Using Machine Learning

Clustering

Project: Bag of Visual Words

Generalized Linear Regression

Face Recognition Using Kernel Linear Discriminant

Support Vector Machine

Logistic Regression

Ensemble Learning

Early Stage Diabetes Prediction Using Ensemble Learning

Decoding Dimensions: PCA and Autoencoders

Image Reconstruction Using PCA

Image Colorization using Autoencoders

Colorful Face Generation with VAEs

Appendix

Wrapping Up

How to Predict the Traffic Volume Using Machine Learning

Hard-Margin SVM

Hard-margin SVM

Linearly separable case