All Categories → Data Processing → record-linkage

Top 8 record-linkage open source projects

Libpostal
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Dedupe
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
record-linkage-resources
Resources for tackling record linkage / deduplication / data matching problems
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
1-8 of 8 record-linkage projects