Training the NMT

Understand the process of training neural machine translation models by preparing data, implementing custom training loops in TensorFlow, and tracking performance using the BLEU score, a key metric for evaluating translation quality in NLP.

We'll cover the following...

The prepare_data() function
The shuffle_data() function
The train_model() function
Improving NMT performance with deep GRUs
Try it yourself

Python 3.10.4

def prepare_data(de_lookup_layer, train_xy, valid_xy, test_xy):
    """ Create a data dictionary from the DataFrames containing data
    """
    data_dict = {}
    for label, data_xy in zip(['train', 'valid', 'test'], [train_xy, valid_xy, test_xy]):
        data_x, data_y = data_xy
        en_inputs = data_x
        de_inputs = data_y[:,:-1]
        de_labels = de_lookup_layer(data_y[:,1:]).numpy()
        data_dict[label] = {'encoder_inputs': en_inputs, 'decoder_inputs': de_inputs, 'decoder_labels': de_labels}
    return data_dict

The prepare_data() function takes the source sentence and target sentence pairs and generates encoder and decoder inputs and decoder labels. Let’s look at the arguments:

de_lookup_layer: The StringLookup layer of the German language.
train_xy: A tuple containing tokenized English sentences and tokenized German sentences in the training set, respectively.
valid_xy: Similar to train_xy but for validation data.
test_xy: Similar to train_xy but for test data.

For each training, validation, and test dataset, this function generates the following: ...

1.Introduction to Natural Language Processing

2.Understanding TensorFlow 2

3.Word2vec: Learning Word Embeddings

4. Advanced Word Vector Algorithms

5.Sentence Classification with Convolutional Neural Networks

6.Recurrent Neural Networks

7.Understanding Long Short-Term Memory Networks

8.Applications of LSTM: Generating Text

9.Sequence-to-Sequence Learning: Neural Machine Translation

10.Transformers

Project

11.Image Captioning with Transformers

12.Final Remarks

13.Appendix: Mathematical Foundations and Advanced TensorFlow

Mock Interview

Training the NMT

The `prepare_data()` function

Training the NMT

The prepare_data() function

The `prepare_data()` function