Trusted answers to developer questions

What are Naive Bayes classifiers?

Get Started With Machine Learning

Learn the fundamentals of Machine Learning with this free course. Future-proof your career by adding ML skills to your toolkit — or prepare to land a job in AI or Data Science.

Naive Bayes classifiers are based on Bayes’ theorem and assume that the occurrence or absence of a feature does not influence the presence or absence of some other feature.

Types

Gaussian Naive Bayes classifier: used when features are not discreet.
Multinomial Naive Bayes Classifier: used when features follow a multinomial distribution.
Bernoulli Naive Bayes classifier: used when features are of the boolean type.

Derivation

Let’s take a look at the Mathematics behind Naive Bayes classifiers.

The equation for Bayes theorem is:

$P(class|X) = P(X|class)P(class)/P(X)$

A class variable is something that the classifier is trying to classify. For instance, when trying to classify an email as spam or not, “is spam” is the class variable.

In the equation above, $class$ is the class variable and $X$ is the set of features. $X = (x_1, x_2, ... x_n)$

The above formula can be rewritten as:

$P(class|x_1 ... x_n) = P(x_1|class)... P(x_n|class)P(class)/P(x_1)...P(x_n)$

Notice that for all entries in the given dataset, the denominator will not change. Hence, the denominator can be ignored.

$P(class|x_1 ... x_n) \propto P(x_1|class)... P(x_n|class)P(class)$

For all outcomes of the class variable, the class variable with the maximum probability needs to be found using:

$class = argmax(P(x_1|class)... P(x_n|class)P(class))$

Note: Different Naive Bayes classifiers make different assumptions regarding the distribution of $P(x_i | class)$ .

Applications

Some applications that use Naive Bayes classifiers are:

Spam Filtering
Text Analysis
Recommendation Systems

RELATED TAGS

classification

algorithms

machine learning

Learn in-demand tech skills in half the time

PRODUCTS

Mock Interview

New

Courses

Skill Paths

Projects

Assessments