What is affine transformation?

In this era, data is of great importance. Many companies spend their fortune on collecting suitable data. As only a handful of open source datasets are available. This can be targeted by artificially creating new data. When it comes to image data, we can perform different transformations. This process of artificially generating new data is called data augmentation.

The combination of linear transformations is called an affine transformation. By linear transformation, we mean that lines will be mapped to new lines preserving their parallelism, and pixels will be mapped to new pixels without disrupting the distance ratio. Affine transformation is also used in satellite image processing, data augmentation for images, and so on.

These transformations are performed by different matrices multiplication with a matrix $M$ . Different transformations require different kernel matrices that give respective transformations when multiplied by the image matrix. The affine transformation consists of the following transformations:

Scaling
Translation
Shear
Rotation

Note: A combination of these transformations is also an affine transformation.

Mathematical background

As mentioned earlier, matrix multiplication and addition play a big role in affine transformation. We first take a point $X$ with $x \text{ and } y$ from the image and represent it as a vector but with a three dimension set to 1. It is important to include this third dimension because otherwise, the transformation would not be linear.

What is affine transformation?

Mathematical background

Scaling

Code implementation

Translation

Code implementation

Shear

Code implementation

Rotation

Code implementation