CNN for Classification

Explore how to build a convolutional neural network to classify handwritten digit images from the MNIST dataset. Understand input and output tensor shapes, convolutional layers, pooling, and how to assemble a CNN architecture. Learn the role of ReLU activation, dropout, and converting logits to probabilities. This lesson provides foundational knowledge to implement and train CNNs for image classification tasks.

We'll cover the following...

Problem statement
The input and output tensor shapes
The tensor flow graph
Assembling building blocks
Passing a tensor through the CNN

We saw in the previous lesson that a multilayer perceptron can be trained to classify tabular data. An image differs from tabular data in the sense that an image contains unstructured information. We cannot go to a predefined pixel to extract a useful feature for the classification. Objects in an image dataset can appear anywhere under various poses. For this reason, we generally cannot treat an image as a vector of length $H\times W$ and process it with a multilayer perceptron.

A better approach is to extract high-level features through a composition of convolution layers and spatial pooling. At some point, the spatial resolution is sufficiently low, and we can flatten the image into a vector. This vector can then be processed by a multilayer perceptron.

Problem statement

In this lesson, our task is to build a CNN with the building blocks we studied in the previous lesson. The task is to classify monochrome images of handwritten digits from the MNIST dataset. Each image has a size of 28x28. The classes are the 10 digits, from zero to nine.

1.Introduction

2.Getting Started with Images

Assessment

3.Color Spaces and Thresholding

Assessment

4.Smoothing and Masking

5.Detection of Features

6.Image Registration

7.3D Vision

8.Getting Started with Neural Networks

9.Convolutional Neural Networks

Mini Project

10.Object Detection and Semantic Segmentation

Project

11.Dataset Annotation

12.Final Remarks

Project

CNN for Classification

Problem statement

The input and output tensor shapes