BERTSUM for Extractive Summary

Explore how to apply BERTSUM for extractive summarization by classifying important sentences with a simple classifier, inter-sentence transformer, or LSTM. Understand how to fine-tune the pre-trained BERT model jointly with the summarization layer for effective text summarization tasks.

We'll cover the following...

BERTSUM with a classifier
BERTSUM with a transformer and LSTM
BERTSUM with an inter-sentence transformer
BERTSUM with LSTM

In extractive summarization, we create a summary by selecting only the important sentences from the given text. To perform extractive summarization, we obtain the representation of every sentence in the given text using a pre-trained BERT model.

Now let's see how to use BERTSUM in the following three ways:

BERTSUM with a simple classifier
BERTSUM with an inter-sentence transformer
BERTSUM with LSTM

BERTSUM with a classifier

We feed the representation of a sentence to a simple binary classifier, and the classifier tells us whether the sentence is important or not. That is, the classifier returns the probability of the sentence being included in the summary. The classification layer is often called the summarization layer. This is shown in the following figure:

From the preceding figure, we can observe that we feed all the sentences from a given text to the pre-trained BERT model. The pre-trained BERT model will return the representation of each sentence, $R_1, R_2,...,R_i,...,R_n$ . Then we feed the representation to a classifier (summarization layer). The classifier then returns the probability of the sentence being included in the summary.

For each sentence $i$ in the document, we will get the sentence representation $R_i$ , and we feed the representation to the summarization layer, which returns the probability $\hat{Y}_i$ ...

1.Before We Start

2.Starting Off with BERT

3.A Primer on Transformers

Project

4.Understanding the BERT Model

5.Getting Hands-On with BERT

6.Exploring BERT Variants

7.Different BERT Variants

8.BERT Variants—Based on Knowledge Distillation

9.Applications of BERT

10.Exploring BERTSUM for Text Summarization

11.Applying BERT to Other Languages

12.Exploring Sentence and Domain-Specific BERT

13.Working with VideoBERT, BART, and More

14.Conclusion

Project

BERTSUM for Extractive Summary

BERTSUM with a classifier