What is the perceptron learning rule?

Within the domain of machine learning, a perceptron is an algorithm used for the supervised learning of binary classifiers.

Supervised learning

Supervised learning refers to the training of a model using a labeled dataset. A labeled dataset has labeled input and output parameters.

Error and adjustments

Before we can use a perceptron to predict output, it is trained using labeled data. For each input in the training set, we compute the output. If the observed output does not match the expected output, we calculate the error.

Initially, we usually set the weights to randomly selected numbers.

We can then use the error to tweak the weights in favor of the expected output. We repeat this process until the perceptron gives a high degree of accuracy in its output.

How much we adjust our weights is controlled by the learning rate of the perceptron.

The following is our activation function:

Y = 1 if wx+b > 0
and
Y = 0 if wx+b ≤ 0

Initially, we set the weight ( $w_1, w_2$ ) as 1. The bias is set to -1.

First row

Let’s take the first row. After applying the net input function on the first row, we get the following:

$x_1(1) + x_2(1) - 1 = -1$

Applying the activation function for -1:

Y = 0

The observed output (0) matches the expected output (0), so there is no need to tweak the weights.

Second row

Let’s take the second row. After applying the net input function on the second row, we get the following:

$x_1(1) + x_2(1) - 1 = 0$

Applying the activation function for 0:

Y = 0

The observed output (0) does not match the expected output (1). We need to tweak the weights.

Consider setting $w_2$ to 2. Now, when we apply the apply the net input function, we get the following:

$x_1(1) + x_2(2) - 1 = 1$

Apply the activation function for 1:

Y = 1

The outputs match.

Third row

Now, we take the third row. After applying the net input function on the third row, we get the following:

$x_1(1) + x_2(2)-1 =0$

Applying the activation function for 0:

Y = 0

The observed output (0) does not match the expected output (1). We need to tweak the weights.

Since this case is the symmetric case for the second, we simply change $w_1$ to 2 as well.

Now, applying the net input function:

$x_1(2)+x_2(2)-1=1$

Apply the activation function for 1:

Y = 1

The outputs match.

Fourth row

Finally, take the fourth row. After applying the net input function on the fourth row, we get the following:

$x_1(2) + x_2(2) - 1 = 3$

Applying the activation function:

Y = 1

The observed output (1) matches the expected output (1), so there is no need to tweak the weights.

Thus, we conclude that our perceptron with weights set to 2 and bias set to -1 works perfectly for the logical OR gate with two inputs.

What is the perceptron learning rule?

Supervised learning

Binary classifiers

Perceptron learning rule

Activation function

Error and adjustments

Bias

Example

First row

Second row

Third row

Fourth row

x₁	x₂	Y
0	0	0
0	1	1
1	0	1
1	1	1