Text Generation and the Magic of LSTMs

Explore the fundamentals of text generation using recurrent neural networks, focusing on LSTMs that capture context in language data. Learn to prepare training datasets, build character-level language models with TensorFlow and Keras, and train these models to predict sequences effectively.

We'll cover the following...

Language modeling
Hands-on: Character-level language model

In this section, we'll leverage this understanding of text representation to work our way toward building text generation models.

So far, we have built models using feedforward networks consisting of different kinds and combinations of layers. These networks work with one training example at a time, which is independent of other training samples. We say that the samples are independent and identically distributed, or IID. Language, or text, is a bit different.

Words change their meaning based on the context they are being used in. In other words, if we were to develop and train a language generation model, we would have to ensure the model understands the context of its input.

Recurrent neural networks (RNNs) are a class of neural networks that allow previous outputs to be used as inputs, along with memory or hidden units. This awareness of previous inputs helps in capturing context and provides us with the ability to handle variable-length input sequences (sentences are hardly ever of the same length). A typical RNN is depicted in the following diagram, in both actual and unrolled form:

1.Introduction to the Course

2.An Introduction to Generative AI

3.Building Blocks of Deep Neural Networks

4.Teaching Networks to Generate Digits

5.Painting Pictures with Neural Networks Using VAEs

Project

6.Image Generation with GANs

Project

7.Style Transfer with GANs

Assessment

8.Deepfakes with GANs

9.The Rise of Methods for Text Generation

10.NLP 2.0: Using Transformers to Generate Text

11.Composing Music with Generative Models

Project

12.Play Video Games with Generative AI: GAIL

13.Emerging Applications in Generative AI

Assessment

14.Conclusion

15.Appendix

Text Generation and the Magic of LSTMs