Fundamentals of Machine Learning: A Pythonic Introduction/

...

VAEs in Action

Master VAEs for image generation, including data preprocessing, training techniques, architecture design, training stages, and image synthesis through encoding and sampling.

We'll cover the following...

Overview
Loading and preprocessing the data
- Unique training approach in VAEs
Defining the architecture
Training
Generating new images

Press + to interact

Python 3.5

import torch
import torch.nn as nn
import torch.optim as optim
import torchvision
import torch.nn.functional as nnF
# Load and preprocess the MNIST dataset
mnist_train = torchvision.datasets.MNIST(root='./data', train=True, download=True)
x_train = mnist_train.data
y_train = mnist_train.targets
x_train = x_train.float() / 255.0
# Visualize the dataset
import matplotlib.pyplot as plt
import numpy as np
num_rows = 2
num_columns = 5
fig, axes = plt.subplots(num_rows, num_columns, figsize=(4, 2))
for category in range(10):
    category_indices = np.where(y_train == category)[0]
    random_index = np.random.choice(category_indices)
    image_np = np.array(x_train[random_index])
    row = category // num_columns
    col = category % num_columns
    ax = axes[row, col]
    ax.imshow(image_np, cmap='gray')
    ax.axis('off')
plt.tight_layout()
plt.show()

Defining the architecture

The autoencoder’s architecture has three main components.

Encoder

The encoder class is a crucial part of the VAE architecture responsible for compressing the input MNIST image into a lower-dimensional latent space. It consists of three linear layers: self.input, self.hidden_mean, and self.hidden_std. The encoder processes the input image through these layers and outputs the mean (mean) and standard deviation (std) vectors of the latent Gaussian distribution. These vectors represent the parameters of the ...

Course Overview

Supervised Learning

Detect Cyber Intrusion Using Machine Learning

Clustering

Project: Bag of Visual Words

Generalized Linear Regression

Face Recognition Using Kernel Linear Discriminant

Support Vector Machine

Logistic Regression

Ensemble Learning

Early Stage Diabetes Prediction Using Ensemble Learning

Decoding Dimensions: PCA and Autoencoders

Image Reconstruction Using PCA

Image Colorization using Autoencoders

Colorful Face Generation with VAEs

Appendix

Wrapping Up

How to Predict the Traffic Volume Using Machine Learning

VAEs in Action

Overview

Loading and preprocessing the data

Unique training approach in VAEs

Defining the architecture

Encoder