Applying Hugging Face Machine Learning Pipelines in Python/

...

Text and Token Classification

Learn to perform text and token classification tasks using the Hugging Face models.

We'll cover the following...

Text classification

Sentiment analysis
Natural Language Inference (NLI)

Question-answering NLI (QNLI)
Example

Grammatical verification

Token classification

Named Entity Recognition (NER)
Part-of-Speech (PoS) tagging

More examples

Press + to interact

Some other uses of text classification include:

Sentiment analysis
Natural Language Inference (NLI)
Grammatical verification

Sentiment analysis

Have you ever wondered how companies like Amazon know if a certain product is a success or flop based on customer reviews? Thanks to NLP, we can perform their sentiment analysis. In sentiment analysis, we take a sentence and infer if it's positive, negative, or neutral.

As an example, we apply it to one of the most iconic opening lines from Herman Melville's classic, Moby Dick (1851).

Press + to interact

Unsurprisingly, it'll return “NEGATIVE” (with a high confidence score) due to a lot of "no,” "nothing,” and so on.

Natural Language Inference (NLI)

Natural Language Inference is another application of text classification where we provide a hypothesis with some context. It determines whether the hypothesis is one of the following:

True: Commonly referred to in both NLP literature and Hugging Face Models as entailment.
False: Often called a contradiction. Hugging Face uses the same nomenclature.
Undetermined: Oftentimes, there are passages unable to give some clue about the hypothesis. The model terms them as neutral.

Question-answering NLI (QNLI)

These neutral passages led to the development of Question-answering NLI (QNLI), where we get a probability of whether the given passage contains the state of the hypothesis or not.

Example

Here we’ll use the RoBERTa Multi-genre Language Inference (MultiNLI) model. We apply it to the following pair of hypotheses and context. This returns entailment.

Press + to interact

Grammatical verification

We all noticed a significant improvement in the capabilities of email editors. They're quick to notify us if the email content has any grammatical errors. This improvement is not limited to email editors and can be seen in other applications such as writing assistants. All these performance gains can be attributed to improvement in grammatical verification — the type of text classification in which we verify whether a given sentence is grammatically correct or not. For complex sentences, a simple yes or no is not sufficient, and therefore, these models also return a score as a confidence or correctness measure of the input.

Token classification

It can be difficult to understand natural languages. We're required to perform some pre-processing before inputting it into an NLP model. Tokenization allows us to demarcate parts of a sentence.

Press + to interact

Introduction

NLP

Computer Vision

Conclusion

Appendix

Text and Token Classification

Text classification

Sentiment analysis

Natural Language Inference (NLI)

Question-answering NLI (QNLI)

Example

Grammatical verification

Token classification

Named Entity Recognition (NER)

Part-of-Speech (PoS) tagging

More examples