What are variational autoencoders (VAEs)?

Introduction

Variational autoencoders (VAEs) are one of the most commonly used models for content generation. In simple terms, it is an autoencoder with regularization.

VAEs dive deep into the probabilistic domain, and instead of getting a single output from the encoder, we get a probability distribution for each latent attribute in the latent space.When we compress the data, similar data points are placed together in a space with reduced dimensions. This space is called latent space. This distribution is used in combination with the decoder to generate new content.

In the case of VAEs, the encoder is also known as the generative network. The decoder is known as the recognition network or the inference network.

Need for VAEs

Autoencoders are suitable for encoding and decoding, but they don't perform well when we're using them for content generation. To generate content, we take a latent attribute and use a decoder to generate content. Here, a single latent feature is used since this represents one encoding dimension, and the decoder will then use this to recreate an actual input.

VAEs are better than autoencoders because they can use the normal distribution to generate inputs that the encoder has not seen. This way, we have a smooth latent space, and now we can leverage it to generate entirely new content by interpolating.

Latent space interpolation

Suppose we have two inputs, $x_1$ and $x_2$ . Let $z_1 = E_{q(z|x_1)} [z]$ and $z_2 = E_{q(z|x_2)}[z]$ be the corresponding latent attribute. We can now generate the new content by the following interpolation:

What are variational autoencoders (VAEs)?

Introduction

Need for VAEs

Variational autoencoders (VAEs)

Latent space interpolation