HomeCoursesAn Introduction to Entity Resolution in Python
AI-powered learning
Save

An Introduction to Entity Resolution in Python

Explore entity resolution in Python, including use cases, semantic preprocessing, graph clustering, and weak supervision. Boost business value with hands-on coding and strategic decisions.

5.0
63 Lessons
2 Projects
8h
Join 2.9 million developers at
Join 2.9 million developers at
LEARNING OBJECTIVES
  • The ability to deduplicate records using Python
  • Familiarity with an entity resolution framework and business cases
  • An understanding of semantic similarity and search
  • Experience with classification in the context of entity resolution
  • Hands-on experience in data-centric AI using weak supervision and confident learning

Learning Roadmap

63 Lessons1 Project7 Quizzes1 Assessment

1.

Introduction to Entity Resolution and Applications

Introduction to Entity Resolution and Applications

Learn how to use entity resolution techniques in Python to improve data quality and integration.

2.

A Quickstart Guide Using the RecordLinkage Package

A Quickstart Guide Using the RecordLinkage Package

Get started with entity resolution in Python using text preprocessing, similarity scoring, and evaluation techniques.

3.

Preprocessing

Preprocessing

7 Lessons

7 Lessons

Work your way through preprocessing text and location data for enhanced entity resolution.

4.

Indexing

Indexing

6 Lessons

6 Lessons

Apply your skills to enhance the efficiency of entity resolution using various indexing techniques.

5.

Feature Engineering

Feature Engineering

5 Lessons

5 Lessons

Deepen your knowledge of feature engineering for entity resolution, exploring various similarity methods.

6.

Pairwise Matching

Pairwise Matching

12 Lessons

12 Lessons

See how it works to tackle class imbalances and enhance binary classification in entity resolution.

7.

Clustering

Clustering

6 Lessons

6 Lessons

Piece together the parts of clustering techniques to improve classification accuracy in entity resolution.

8.

Integration

Integration

8 Lessons

8 Lessons

Step through integrating entity resolution in data stacks via various platforms and services.

10.

Appendix

Appendix

3 Lessons

3 Lessons

Examine batch geocoding, vector search with LanceDB, and essential resources for entity resolution.
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.
Author NameAn Introduction to EntityResolution in Python
Developed by MAANG Engineers
ABOUT THIS COURSE
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, and BI apps for other purposes. Records of the same customer (entity) exist in multiple places, likely not in sync across nor unique within sources. This inconsistent situation generates an opportunity for us to drive business value by cross-referencing and deduplicating records with entity resolution. This course covers business acumen and hands-on coding. It starts with several business cases and a quick introduction to entity resolution in Python. Then, it explores semantic-preserving preprocessing, similarity feature engineering, graph clustering, weak supervision, confident learning, and integration. As a developer, you’ll increase your company’s business value by developing and deploying entity resolution pipelines. As a decision-maker, you’ll know which solution best suits your business cases and how to negotiate the best value for your money.
ABOUT THE AUTHOR

Paul Kinsvater

I graduated in 2016 with a PhD in Statistics. Since then, I have pursued a career in data science. My preferred niche is entity resolution specifically and data quality in general.

Learn more about Paul

Trusted by 2.9 million developers working at companies

These are high-quality courses. Trust me the price is worth it for the content quality. Educative came at the right time in my career. I'm understanding topics better than with any book or online video tutorial I've done. Truly made for developers. Thanks

A

Anthony Walker

@_webarchitect_

Just finished my first full #ML course: Machine learning for Software Engineers from Educative, Inc. ... Highly recommend!

E

Evan Dunbar

ML Engineer

You guys are the gold standard of crash-courses... Narrow enough that it doesn't need years of study or a full blown book to get the gist, but broad enough that an afternoon of Googling doesn't cut it.

S

Software Developer

Carlos Matias La Borde

I spend my days and nights on Educative. It is indispensable. It is such a unique and reader-friendly site

S

Souvik Kundu

Front-end Developer

Your courses are simply awesome, the depth they go into and the breadth of coverage is so good that I don't have to refer to 10 different websites looking for interview topics and content.

V

Vinay Krishnaiah

Software Developer

Built for 10x Developers

No Passive Learning
Learn by building with project-based lessons and in-browser code editor
Learn by Doing
Personalized Roadmaps
The platform adapts to your strengths & skills gaps as you go
Learn by Doing
Future-proof Your Career
Get hands-on with in-demand skills
Learn by Doing
AI Code Mentor
Write better code with AI feedback, smart debugging, and "Ask AI"
Learn by Doing
Learn by Doing
MAANG+ Interview Prep
AI Mock Interviews simulate every technical loop at top companies
Learn by Doing

Free Resources

FOR TEAMS

Interested in this course for your business or team?

Unlock this course (and 1,000+ more) for your entire org with DevPath