All Categories → No Category → data-matching

Top 4 data-matching open source projects

record-linkage-resources
Resources for tackling record linkage / deduplication / data matching problems
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
1-4 of 4 data-matching projects