Overview: Core Operations with spaCy

Let's look at what we will be learning in this section.

We will learn the core operations with spaCy, such as creating a language pipeline, tokenizing the text, and breaking the text into its sentences.

First, we'll learn what a language processing pipeline is and the pipeline components. We'll continue with general spaCy conventions—important classes and class organization—to help us to better understand spaCy library organization and develop a solid understanding of the library itself.

We'll then learn about the first pipeline component—Tokenizer. We'll also learn about an important linguistic concept—lemmatization—along with its applications in natural language understanding (NLU). Following that, we will cover container classes and spaCy data structures in detail. We will finish the chapter with useful spaCy features that we'll use in everyday NLP development.

Create a free account to view this lesson.

By signing up, you agree to Educative's Terms of Service and Privacy Policy