HomeCoursesAn Introduction to Entity Resolution in Python

Advanced

8h

Updated 3 months ago

An Introduction to Entity Resolution in Python
Save

Explore entity resolution in Python, including use cases, semantic preprocessing, graph clustering, and weak supervision. Boost business value with hands-on coding and strategic decisions.
Join 2.7 million developers at
Overview
Content
Reviews
Related
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, and BI apps for other purposes. Records of the same customer (entity) exist in multiple places, likely not in sync across nor unique within sources. This inconsistent situation generates an opportunity for us to drive business value by cross-referencing and deduplicating records with entity resolution. This course covers business acumen and hands-on coding. It starts with several business cases and a quick introduction to entity resolution in Python. Then, it explores semantic-preserving preprocessing, similarity feature engineering, graph clustering, weak supervision, confident learning, and integration. As a developer, you’ll increase your company’s business value by developing and deploying entity resolution pipelines. As a decision-maker, you’ll know which solution best suits your business cases and how to negotiate the best value for your money.
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, an...Show More

WHAT YOU'LL LEARN

The ability to deduplicate records using Python
Familiarity with an entity resolution framework and business cases
An understanding of semantic similarity and search
Experience with classification in the context of entity resolution
Hands-on experience in data-centric AI using weak supervision and confident learning
The ability to deduplicate records using Python

Show more

Content

1.

Introduction to Entity Resolution and Applications

7 Lessons

Learn how to use entity resolution techniques in Python to improve data quality and integration.

2.

A Quickstart Guide Using the RecordLinkage Package

8 Lessons

Get started with entity resolution in Python using text preprocessing, similarity scoring, and evaluation techniques.

4.

Indexing

6 Lessons

Apply your skills to enhance the efficiency of entity resolution using various indexing techniques.

5.

Feature Engineering

5 Lessons

Deepen your knowledge of feature engineering for entity resolution, exploring various similarity methods.

7.

Clustering

6 Lessons

Piece together the parts of clustering techniques to improve classification accuracy in entity resolution.

9.

Conclusion

1 Lessons

Look at essential insights and skills for effective entity resolution in various systems.

10.

Appendix

3 Lessons

Examine batch geocoding, vector search with LanceDB, and essential resources for entity resolution.
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.

Course Author:

Developed by MAANG Engineers
Every Educative resource is designed by our team of ex-MAANG software engineers and PhD computer science educators — subject matter experts who’ve shipped production code at scale and taught the theory behind it. The goal is to get you hands-on with the skills you need to stay ahead in today's constantly evolving tech landscape. No videos, no fluff — just interactive, project-based learning with personalized feedback that adapts to your goals and experience.

Trusted by 2.7 million developers working at companies

Hands-on Learning Powered by AI

See how Educative uses AI to make your learning more immersive than ever before.

Instant Code Feedback

Evaluate and debug your code with the click of a button. Get real-time feedback on test cases, including time and space complexity of your solutions.

Adaptive Learning

Explain with AI

AI Code Mentor

Free Resources

FOR TEAMS

Interested in this course for your business or team?

Unlock this course (and 1,000+ more) for your entire org with DevPath