Linear discriminant analysis

Linear discriminant analysis (LDA) is a statistical technique for dimensionality reduction. We use this technique if the features are continuous and the target value is categorical.

LDA aims to maximize the distance between the means of different classes while minimizing the variance within each class.

Example

Suppose we have a dataset with two classes: Class A and Class B with each class having multiple samples. Linear discriminant analysis (LDA) aims to find a linear discriminant or unit vector to project our data to maximize the gap between different classes while reducing the variation within each class. This helps us better distinguish between the classes and make more accurate predictions.

Note: Learn about the implementation of LDA in python.

Conclusion

By projecting the data onto the LDA axis, we can achieve better separability and discrimination between classes, making it easier to classify new data points accurately. However, it's important to note that LDA assumes the data follows a normal distribution and that the classes have identical covariance matrices. In cases where these assumptions are violated, the performance of LDA may not be optimal.

Note: Read about principle component analysis.

Linear discriminant analysis

Example

Procedure

Conclusion