Named Entity Recognition with RNNs: Training and Evaluation

Explore how to train and evaluate recurrent neural networks for named entity recognition tasks. Understand challenges of class imbalance and learn to apply macro-averaged accuracy to get fair metric evaluation. Gain skills to prepare sample weights that balance frequent and rare classes and use them to improve model training and validation.

We'll cover the following...

Evaluation metrics and the loss function
- Different types of metric averaging
Training and evaluating RNN on NER task
Visually analyzing outputs
Try it yourself

Evaluation metrics and the loss function

During our previous discussion, we alluded to the fact that NER tasks carry a high class imbalance. It’s quite normal for text to have more nonentity-related tokens than entity-related tokens. This leads to large amounts of other (0) type labels and fewer of the remaining types. We need to take this into consideration when training the model and evaluating the model. We’ll address the class imbalance in two ways:

We’ll create a new evaluation metric that is resilient to class imbalance.
We’ll use sample weights to penalize more frequent classes and boost the importance of rare classes.

In this lesson, we’ll only address the former. The latter will be addressed in the next lesson. We’ll define a modified version of the accuracy. This is called a macro-averaged accuracy. In macro averaging, we compute accuracies for each class separately and then average it. Therefore, the class imbalance is ignored when computing the accuracy. When computing standard metrics like accuracy, precision, or recall, there are different types of averaging available.

Different types of metric averaging

There are different types of averaging available for metrics. We can read one such example of this averaging available in scikit-learn. Consider a simple binary classification example with the following confusion matrix results:

1.Introduction to Natural Language Processing

2.Understanding TensorFlow 2

3.Word2vec: Learning Word Embeddings

4. Advanced Word Vector Algorithms

5.Sentence Classification with Convolutional Neural Networks

6.Recurrent Neural Networks

7.Understanding Long Short-Term Memory Networks

8.Applications of LSTM: Generating Text

9.Sequence-to-Sequence Learning: Neural Machine Translation

10.Transformers

Project

11.Image Captioning with Transformers

12.Final Remarks

13.Appendix: Mathematical Foundations and Advanced TensorFlow

Mock Interview

Named Entity Recognition with RNNs: Training and Evaluation

Evaluation metrics and the loss function

Different types of metric averaging