Generating Photo-Realistic Images with StackGAN++
Explore how to create photo-realistic images from textual descriptions using StackGAN++. Learn the two-stage generation process, conditioning augmentation, and how the multi-branch structure improves image quality. Understand the key differences and advantages of StackGAN++ over the original StackGAN model, including multi-scale synthesis, unconditional loss, and color-consistency.
We'll cover the following...
The generation of images from description text can be considered a conditional GAN (CGAN) process in which the embedding vector of the description sentence is used as the additional label information. We need to figure out how to generate large images with CGAN. It’s also possible to stack two CGANs together so that we can get high-quality images. This is exactly what
High-resolution text-to-image synthesis with StackGAN
The embedding vector,