Overview of spaCy Conventions

Let's look at the conventions followed by spaCy.

We'll cover the following

Overview of spaCy

Every NLP application consists of several steps of processing the text. As we saw previously, we have always created instances called nlp and doc. But what did we do exactly?

When we call nlp on our text, spaCy applies some processing steps. The first step is tokenization to produce a Doc object. The Doc object is then processed further with a tagger, a parser, and an entity recognizer. This way of processing the text is called a language processing pipeline. Each pipeline component returns the processed Doc and then passes it to the next component:

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy