Search⌘ K
AI Features

Training Data Generation

Explore how to generate training data for entity linking by leveraging open-source datasets like CoNLL-2003 and AIDA CoNLL-YAGO, along with manual labeling for specialized domains. Understand the importance of domain knowledge in annotating entities and linking them to knowledge bases to improve system accuracy.

We'll cover the following...

There are two approaches you can adopt to gather training data for the entity linking problem.

  1. Open-source datasets
  2. Manual labeling

You can use one or both depending on the particular task for which we have to perform entity linking.

Open-source datasets

If the task is not extremely domain-specific and does not require very specific tags, you can avail ...