Natural Language Processing with TensorFlow/

...

Use Case: Implementing BERT

Learn to implement BERT to answer questions.

We'll cover the following...

Implementing and using the tokenizer
Defining a TensorFlow dataset
BERT for answering questions
Defining the config and the model

To use a pretrained transformer model from the Hugging Face repository, we need three components:

Tokenizer: Responsible for splitting a long bit of text (such as a sentence) into smaller tokens.
config: Contains the configuration of the model.
Model: Takes in the tokens, looks up the embeddings, and produces the final outputs using the provided inputs.

We can ignore the config because we’re using the pretrained model as is. However, to show all aspects of this process, we’ll use the configuration nevertheless.

Implementing and using the tokenizer

First, we’ll look at how to download the tokenizer. We can do this using the transformers library. Simply call the from_pretrained() function provided by the PreTrainedTokenizerFast base class:

Let’s look at the arguments we’ve provided to the tokenizer’s call:

text: A single or batch of text sequences to be encoded by the tokenizer. Each text sequence is a string.
text_pair: An optional single or batch of text sequences to be encoded by the tokenizer. It’s useful in situations where the model takes a multipart input (such as a question and a context in question answering).
padding: Indicates the padding strategy. If set to True, it will be padded to the maximum sequence length in the dataset. If set to max_length, it will be padded to the length specified by the max_length argument. If set to False, no padding will be done.
return_tensors: An argument that defines the type of tensors returned. It could be either pt ...

Introduction to Natural Language Processing

Understanding TensorFlow 2

Word2vec: Learning Word Embeddings

Advanced Word Vector Algorithms

Sentence Classification with Convolutional Neural Networks

Recurrent Neural Networks

Understanding Long Short-Term Memory Networks

Applications of LSTM: Generating Text

Sequence-to-Sequence Learning: Neural Machine Translation

Transformers

Sarcasm Classification Using BERT

Image Captioning with Transformers

Caption Generation Using PyTorch

Final Remarks

Appendix: Mathematical Foundations and Advanced TensorFlow

Use Case: Implementing BERT

Implementing and using the tokenizer