Dedupe🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
entity-embedPyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
stanceLearned string similarity for entity names using optimal transport.
conciliatorOpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
whatisWhatIs.this: simple entity resolution through Wikipedia
snowmanWelcome to Snowman App – a Data Matching Benchmark Platform.
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML