Installing spaCy's Statistical Models

Let's learn how we can install statistical models locally.

We'll cover the following

Overview

The spaCy installation doesn't come with the statistical language models needed for the spaCy pipeline tasks. spaCy language models contain knowledge about a specific language collected from a set of resources. Language models let us perform a variety of NLP tasks, including POS tagging and named-entity recognition (NER).

Different languages have different models and are language specific. There are also different models available for the same language. We'll see the differences between those models in detail in the Pro tip at the end of this section, but basically, the training data is different. The underlying statistical algorithm is the same. Some of the currently supported languages are as follows:

Get hands-on with 1200+ tech skills courses.