5.0
Advanced
8h
An Introduction to Entity Resolution in Python
Explore entity resolution in Python, including use cases, semantic preprocessing, graph clustering, and weak supervision. Boost business value with hands-on coding and strategic decisions.
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, and BI apps for other purposes. Records of the same customer (entity) exist in multiple places, likely not in sync across nor unique within sources. This inconsistent situation generates an opportunity for us to drive business value by cross-referencing and deduplicating records with entity resolution.
This course covers business acumen and hands-on coding. It starts with several business cases and a quick introduction to entity resolution in Python. Then, it explores semantic-preserving preprocessing, similarity feature engineering, graph clustering, weak supervision, confident learning, and integration.
As a developer, you’ll increase your company’s business value by developing and deploying entity resolution pipelines. As a decision-maker, you’ll know which solution best suits your business cases and how to negotiate the best value for your money.
A typical business stores data across multiple systems, including ERPs for operations, a CRM for marketing, files, notebooks, an...Show More
WHAT YOU'LL LEARN
The ability to deduplicate records using Python
Familiarity with an entity resolution framework and business cases
An understanding of semantic similarity and search
Experience with classification in the context of entity resolution
Hands-on experience in data-centric AI using weak supervision and confident learning
The ability to deduplicate records using Python
Show more
Content
1.
Introduction to Entity Resolution and Applications
7 Lessons
Learn how to use entity resolution techniques in Python to improve data quality and integration.
2.
A Quickstart Guide Using the RecordLinkage Package
8 Lessons
Get started with entity resolution in Python using text preprocessing, similarity scoring, and evaluation techniques.
3.
Preprocessing
7 Lessons
Work your way through preprocessing text and location data for enhanced entity resolution.
4.
Indexing
6 Lessons
Apply your skills to enhance the efficiency of entity resolution using various indexing techniques.
5.
Feature Engineering
5 Lessons
Deepen your knowledge of feature engineering for entity resolution, exploring various similarity methods.
6.
Pairwise Matching
12 Lessons
See how it works to tackle class imbalances and enhance binary classification in entity resolution.
7.
Clustering
6 Lessons
Piece together the parts of clustering techniques to improve classification accuracy in entity resolution.
8.
Integration
8 Lessons
Step through integrating entity resolution in data stacks via various platforms and services.
9.
Conclusion
1 Lessons
Look at essential insights and skills for effective entity resolution in various systems.
10.
Appendix
3 Lessons
Examine batch geocoding, vector search with LanceDB, and essential resources for entity resolution.
Certificate of Completion
Showcase your accomplishment by sharing your certificate of completion.
Course Author:
Developed by MAANG Engineers
Trusted by 2.8 million developers working at companies
"These are high-quality courses. Trust me. I own around 10 and the price is worth it for the content quality. EducativeInc came at the right time in my career. I'm understanding topics better than with any book or online video tutorial I've done. Truly made for developers. Thanks"
Anthony Walker
@_webarchitect_
"Just finished my first full #ML course: Machine learning for Software Engineers from Educative, Inc. ... Highly recommend!"
Evan Dunbar
ML Engineer
"You guys are the gold standard of crash-courses... Narrow enough that it doesn't need years of study or a full blown book to get the gist, but broad enough that an afternoon of Googling doesn't cut it."
Software Developer
Carlos Matias La Borde
"I spend my days and nights on Educative. It is indispensable. It is such a unique and reader-friendly site"
Souvik Kundu
Front-end Developer
"Your courses are simply awesome, the depth they go into and the breadth of coverage is so good that I don't have to refer to 10 different websites looking for interview topics and content."
Vinay Krishnaiah
Software Developer
Hands-on Learning Powered by AI
See how Educative uses AI to make your learning more immersive than ever before.
AI Prompt
Code Feedback
Explain with AI
AI Code Mentor
Free Resources