GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ deduplicate-data
Top 1 deduplicate-data open source projects
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
✭ 181
Roff
python
spark
record-linkage
entity-resolution
fuzzy-matching
deduplication
em-algorithm
data-matching
deduplicate-data
1-1
of
1
deduplicate-data projects