Entity Extraction

Explore how to implement entity extraction in a chatbot's natural language understanding pipeline using spaCy. Learn to identify and filter key entities including city, date, time, phone numbers, and cuisine types with spaCy's NER model and the Matcher class to support accurate semantic parsing.

We'll cover the following...

Extracting city entities
Extracting DATE and TIME entities
Extracting phone numbers
Extracting cuisine types

In this code segment, we listed all named entities of this utterance by calling doc.ents. Then, we examined the entity labels by calling ent.label_. Examining the output, we see that this utterance contains five entities—one cardinal number entity (2), one TIME entity (11:30 am), one PRODUCT entity (Bird, which is not an ideal label for a restaurant), one CITY entity (Palo Alto), and one DATE entity (today). The GPE type entity is what we're looking for; Palo Alto is a city in the US and hence is labeled by the spaCy NER model as GPE.

The code below outputs all the utterances that include a city entity together with the city entities. From the output of this script, we can see that the spaCy NER model performs very well on ...

1.Getting Started

2.Core Operations with spaCy

3.Linguistic Features

4.Rule-Based Matchmaking

5.Working with Word Vectors and Semantic Similarity

6.Putting Everything Together: Semantic Parsing with spaCy

Assessment

Project

7.Customizing spaCy Models

8.Text Classification with spaCy

9.spaCy and Transformers

10.Putting Everything Together: Designing a Chatbot with spaCy

11.Appendix

12.Conclusion

Assessment

Entity Extraction

Extracting city entities