Automated Inspection with Computer Vision/

...

Homographies

Learn to register images of planar objects with affine and perspective transformations.

We'll cover the following...

Projection matrix of a pinhole camera
- Perspective projection
- Affine projection
Locating corners
Affine transformation
Perspective transformation

Press + to interact

Suppose we have a client who wants us to inspect the correct printing of the book title “WE IMAGINE WE DRAW OBJECTS” and the educational series, “BARRON’S”, at the bottom of the book. Unfortunately, the camera can capture images from different points of view, and these two images represent the most extreme cases. Our task is to register these images such that the features appear approximately at the same place, under the same pose.

Imagine a virtual camera looking perpendicularly to each planar surface, with the pixel axes aligned with the book edges. We search for a transformation that warps our original images into an image captured by this virtual camera. If we succeed, the features to inspect should always be more or less in the same image area, aligned with the image axes.

This transformation (a projection from the object plane to a virtual camera image plane) is called homography. OpenCV offers two homographic projection functions: cv2.warpAffine() and cv2.warpPerspective(). We’ll apply both transformations to our images to understand when we should use one or the other, but first, we should look at the geometry of image projection.

Projection matrix of a pinhole camera

A simple model for a camera is the pinhole camera model. Imagine that a camera is made of a light sensor area (the image plane) in a closed box, with only a very small hole (the pinhole) allowing rays of light to reach the image plane. The distance between the image plane and the pinhole is $f$ , the focal distance.

Note: In a real camera, a lens plays the role of the pinhole. The lens allows more light to reach the sensor, at the cost of a shorter depth of field.

Press + to interact

Introduction

Getting Started with Images

Image I/O and Annotations

Color Spaces and Thresholding

Convert Color Spaces, Threshold

Smoothing and Masking

Detection of Features

Image Registration

3D Vision

Getting Started with Neural Networks

Convolutional Neural Networks

Project: Create and Train a CNN for Classification

Object Detection and Semantic Segmentation

Cats vs Dogs Classification with Convolutional Neural Networks

Dataset Annotation

Final Remarks

Recognize Handwritten Digits Using a Deep Neural Network

Homographies

Projection matrix of a pinhole camera