Natural Language Processing with TensorFlow/

...

Comparing LSTMs to LSTMs with Peephole Connections and GRUs

Compare the performance of LSTMs with their variants.

We'll cover the following...

Standard LSTM
GRUs
- The GRU-based language model
LSTMs with peepholes
- Review of LSTMs with peepholes
- The code
Training and validation perplexities over time
Try it yourself

Now, we’ll compare LSTMs to LSTMs with peepholes and GRUs in the text generation task. This will help us to compare how well these different models perform in terms of perplexity. Remember that we prefer perplexity over accuracy because accuracy assumes there’s only one correct token given a previous input sequence. However, as we have learned, language is complex, and there can be many different correct ways to generate text given previous inputs.

Standard LSTM

First, we’ll reiterate the components of a standard LSTM. We won’t repeat the code for standard LSTMs because it’s identical to what we discussed previously. Finally, we’ll see some text generated by an LSTM.

Here, we’ll revisit what a standard LSTM looks like. As we already mentioned, an LSTM consists of the following:

Input gate: This decides how much of the current input is written to the cell state.
Forget gate: This decides how much of the previous cell state is written to the current cell state.
Output gate: This decides how much ...

Introduction to Natural Language Processing

Understanding TensorFlow 2

Word2vec: Learning Word Embeddings

Advanced Word Vector Algorithms

Sentence Classification with Convolutional Neural Networks

Recurrent Neural Networks

Understanding Long Short-Term Memory Networks

Applications of LSTM: Generating Text

Sequence-to-Sequence Learning: Neural Machine Translation

Transformers

Sarcasm Classification Using BERT

Image Captioning with Transformers

Caption Generation Using PyTorch

Final Remarks

Appendix: Mathematical Foundations and Advanced TensorFlow

Comparing LSTMs to LSTMs with Peephole Connections and GRUs

Standard LSTM