LibpostalA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Dedupe🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
entity-embedPyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
stanceLearned string similarity for entity names using optimal transport.