An Introductory Guide to Data Science and Machine Learning/

...

Apriori Algorithm and Association Rules

Association Rule Mining is an important technique. We use it to discover rules between variables in large databases. Apriori algorithm helps to find the frequent itemsets from which Association Rules are made. You’ll learn about these concepts here.

We'll cover the following...

Association Rule Mining

Metrics for Evaluating Association Rules

Support
Confidence
Lift

Application of Association Rule Mining

Algorithms for mining frequent patterns

Apriori Algorithm
FP-growth Algorithm

Association Rule Mining

Association Rule Mining helps us find rules and relationships in the dataset. It works for both relational databases and transactional databases. It is also used to find the correlated features with each other. An association rule has two parts: antecedent and consequent. An antecedent is found in the dataset at hand and a consequent is found by using the antecedent. One such example of association rule is:

${Antecedent} -> {Consequent}$
${Diaper} -> {Beer}$
$X -> Y$

$X$ and $Y$ are called antecedents and consequent, respectively. It can be read as: People who buy diapers are also likely to buy beer. “Diaper” and “beer” are the items. This rule has been deduced out of a dataset. This rule can help the companies to increase revenue and make smart decisions based on it.

Metrics for Evaluating Association Rules

There are various metrics involved in evaluating the Interest of Association Rules. Association Rules are carefully derived from the dataset. Let us consider the following transactional table.

Transactional ID	Items
1	Bread, Milk
2	Bread, Diaper,Bear Eggs
3	Milk, Diaper, Beer, Coke
4	Bread, Milk, Diaper, Beer
5	Bread, Milk, Diaper, Coke

Support

Support tells us about how frequent or popular an itemset is, as measured by the proportion of transactions in which an itemset appears. It is a value between 0 and 1. Values closer to 1 show that itemsets occur more frequently in the dataset. We refer to an itemset as a frequent itemset if support is larger than a specified minimum-support threshold. In the above table, we have:

Support\{Beer\}=\frac{3}{5}

There are a total of five transactions, and out of those three have the item beer appearing in them.

Support\{Milk, Coke\}= \frac{2}{5}

What is Data Science ?

Applications of Data Science

Overview of Libraries

Probability and Statistics

Machine Learning Part-1

Machine Learning Part-2

Machine Learning Part-3

Deep Learning

Machine Learning Tools and Libraries

Big Data Tools and Technologies

Where to go next ?

Apriori Algorithm and Association Rules

Association Rule Mining

Metrics for Evaluating Association Rules

Support