Natural Language Processing with TensorFlow/

...

Improving LSTMs: Generating Text with Words Instead of N-grams

Learn how using words instead of n-grams can improve LSTMs.

We'll cover the following...

The curse of dimensionality
Word2vec to the rescue
Generating text with Word2vec

Here, we’ll discuss ways to improve LSTMs. We have so far used bigrams as our basic unit of text. But we would get better results by incorporating words instead of bigrams. This is because using words reduces the overhead of the model by alleviating the need to learn to form words from bigrams. We’ll discuss how we can employ word vectors in the code to generate better-quality text compared to using bigrams.

The curse of dimensionality

One major limitation stopping us from using words instead of n-grams as the input to our LSTM is that this will drastically increase the number of parameters in our model. Let’s try to understand this through an example. Consider that we have an ...

Introduction to Natural Language Processing

Understanding TensorFlow 2

Word2vec: Learning Word Embeddings

Advanced Word Vector Algorithms

Sentence Classification with Convolutional Neural Networks

Recurrent Neural Networks

Understanding Long Short-Term Memory Networks

Applications of LSTM: Generating Text

Sequence-to-Sequence Learning: Neural Machine Translation

Transformers

Sarcasm Classification Using BERT

Image Captioning with Transformers

Caption Generation Using PyTorch

Final Remarks

Appendix: Mathematical Foundations and Advanced TensorFlow

Improving LSTMs: Generating Text with Words Instead of N-grams

The curse of dimensionality