Home/Blog/Data Science/Data Science Simplified: Top 5 NLP tasks that use Hugging Face

Data Science Simplified: Top 5 NLP tasks that use Hugging Face

4 min read

Oct 28, 2020

content

Sentiment Analysis

Question Answering

Text Generation

Summarization

Translation

What to learn next

Continue reading about NLP and ML

Hugging Face is a company devoted to the development of NLP technologies and democratization of artificial intelligence through natural language technologies. Their teams have changed the way we approach NLP by providing easy-to-understand language model architectures.

The Hugging Face Transformers pipeline is an easy way to perform different NLP tasks. It can be used to solve a variety of NLP projects with state-of-the-art strategies and technologies.

Today, I want to introduce you to the Hugging Face pipeline by showing you the top 5 tasks you can achieve with their tools.

Today, we will go over:

Sentiment Analysis
Question Answering
Text Generation
Summarization
Translation
What to learn next

Learn the techniques for solving real NLP problems.
This course teaches the top techniques for processing text data, creating word embeddings, and using LSTM networks for NLP tasks.

Natural Language Processing with Machine Learning

nlp = pipeline("question-answering")
context = r"""
The property of being prime (or not) is called primality.
A simple but slow method of verifying the primality of a given number n is known as trial division.
It consists of testing whether n is a multiple of any integer between 2 and itself.
Algorithms much more efficient than trial division have been devised to test the primality of large numbers.
These include the Miller–Rabin primality test, which is fast but has a small probability of error, and the AKS primality test, which always produces the correct answer in polynomial time but is too slow to be practical.
Particularly fast methods are available for numbers of special forms, such as Mersenne numbers.
As of January 2016, the largest known prime number has 22,338,618 decimal digits.
"""
#Question 1
result = nlp(question="What is a simple method to verify primality?", context=context)
print(f"Answer: '{result['answer']}'")
#Question 2
result = nlp(question="As of January 2016 how many digits does the largest known prime consist of?", context=context)
print(f"Answer: '{result['answer']}'")

The output for the above code is:

A person must always work hard and be prepared to do so. The following are some of the things that you should do to help yoursefl: 1. Be prepared to work hard. 2. Be prepared to work hard.

summarizer = pipeline("summarization")
ARTICLE = """The Apollo program, also known as Project Apollo, was the third United States human spaceflight program carried out by the National Aeronautics and Space Administration (NASA), which accomplished landing the first humans on the Moon from 1969 to 1972.
First conceived during Dwight D. Eisenhower's administration as a three-man spacecraft to follow the one-man Project Mercury which put the first Americans in space,
Apollo was later dedicated to President John F. Kennedy's national goal of "landing a man on the Moon and returning him safely to the Earth" by the end of the 1960s, which he proposed in a May 25, 1961, address to Congress. 
Project Mercury was followed by the two-man Project Gemini (1962–66). 
The first manned flight of Apollo was in 1968.
Apollo ran from 1961 to 1972, and was supported by the two-man Gemini program which ran concurrently with it from 1962 to 1966. 
Gemini missions developed some of the space travel techniques that were necessary for the success of the Apollo missions.
Apollo used Saturn family rockets as launch vehicles. 
Apollo/Saturn vehicles were also used for an Apollo Applications Program, which consisted of Skylab, a space station that supported three manned missions in 1973–74, and the Apollo–Soyuz Test Project, a joint Earth orbit mission with the Soviet Union in 1975.
 """
summary=summarizer(ARTICLE, max_length=130, min_length=30, do_sample=False)[0]
print(summary['summary_text'])

The summary generated for the above paragraph is:

The Apollo program, also known as Project Apollo, was the third U.S. human spaceflight program carried out by the National Aeronautics and Space Administration (NASA) The first manned flight of Apollo was in 1968. The program was dedicated to President Kennedy's national goal of "landing a man on the Moon and returning him safely to the Earth"

The translated sentence is:

Ein großes Hindernis für das Glück besteht darin, zu viel Glück zu erwarten.

What to learn next#

NLP is a powerful tool, and there is so much to learn. If you are interested in exploring NLP on your own or designing projects using Hugging Face, consider starting with the following concepts:

Embeddings

Language Models

Bidirectional LSTM

Seq2Seq Models

and more

Check out Educative’s course Natural Language Processing with Machine Learning to get started with these topics and beyond. You’ll learn the techniques for processing text data, creating word embeddings, and using LSTM networks for NLP tasks. After completing this course, you will be able to solve the important day-to-day NLP problems on your own.

Happy learning!

Continue reading about NLP and ML#

Data Science Simplified: What is language modeling for NLP?

What is Natural Language Processing? Recent advances in the field

Crack the top 40 machine learning interview questions

Written By:
Aman Anand

Free AI Mock Interviews

Coding Interview
Coding PatternsFree Interview
Gain insights and practical experience with coding patterns through targeted MCQs and coding problems, designed to match and challenge your expertise level.
System Design
YouTubeFree Interview
Learn to design a video streaming platform like YouTube by tackling functional and non-functional requirements, core components, and high-level to detailed design challenges.

Free Resources

Data Science Simplified: Top 5 NLP tasks that use Hugging Face

Learn the techniques for solving real NLP problems.
This course teaches the top techniques for processing text data, creating word embeddings, and using LSTM networks for NLP tasks.

Natural Language Processing with Machine Learning

Sentiment Analysis#

Question Answering#

Text Generation#

Summarization#

Translation#

What to learn next#

Continue reading about NLP and ML#

Data Science Simplified: Top 5 NLP tasks that use Hugging Face

Learn the techniques for solving real NLP problems. This course teaches the top techniques for processing text data, creating word embeddings, and using LSTM networks for NLP tasks. Natural Language Processing with Machine Learning

Sentiment Analysis#

Question Answering#

Text Generation#

Summarization#

Translation#

What to learn next#

Continue reading about NLP and ML#

Learn the techniques for solving real NLP problems.
This course teaches the top techniques for processing text data, creating word embeddings, and using LSTM networks for NLP tasks.

Natural Language Processing with Machine Learning