Decoding Strategies

Explore key decoding strategies used in text generation with AI models, including greedy decoding, beam search, and sampling. Understand how these methods affect the quality and variability of generated text, and learn how to implement them with controls like temperature to balance creativity and coherence.

We'll cover the following...

Greedy decoding
Beam search
Sampling
- Temperature
Top-k sampling
Hands-on: Decoding strategies

Now that we have a trained model, the next step is to input some context words and generate the next word as output. This output generation step is formally known as the decoding step. It is termed “decoding” because the model outputs a vector which has to be processed to get the actual word as output. There are a few different decoding techniques; let’s briefly discuss the popular ones: greedy decoding, beam search, and sampling.

Greedy decoding

This is the simplest and fastest decoding strategy. As the name suggests, greedy decoding is a method that picks up the highest probability term at every prediction step.

While this is fast and efficient, being greedy does create a few issues while generating text. By focusing on only the highest probability outputs, the model may generate inconsistent or incoherent outputs. In the case of character-language models, this may even result in outputs that are non-dictionary words. Greedy decoding also limits the variance of outputs, which may result in repetitive content as well.

Beam search

Beam search is a widely used alternative to greedy decoding. This decoding strategy, instead of picking the highest probability term, keeps track of $n$ possible outputs at every timestep. The following diagram illustrates the beam search decoding strategy. It shows multiple beams forming from step 0, creating a tree-like structure:

1.Introduction to the Course

2.An Introduction to Generative AI

3.Building Blocks of Deep Neural Networks

4.Teaching Networks to Generate Digits

5.Painting Pictures with Neural Networks Using VAEs

Project

6.Image Generation with GANs

Project

7.Style Transfer with GANs

Assessment

8.Deepfakes with GANs

9.The Rise of Methods for Text Generation

10.NLP 2.0: Using Transformers to Generate Text

11.Composing Music with Generative Models

Project

12.Play Video Games with Generative AI: GAIL

13.Emerging Applications in Generative AI

Assessment

14.Conclusion

15.Appendix

Decoding Strategies

Greedy decoding

Beam search