GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ data-matching
Top 4 data-matching open source projects
record-linkage-resources
Resources for tackling record linkage / deduplication / data matching problems
✭ 67
record-linkage
entity-resolution
deduplication
data-matching
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
✭ 96
Jupyter Notebook
python
deep-learning
record-linkage
entity-resolution
pytorch
embeddings
representation-learning
deduplication
entity-matching
data-matching
approximate-nearest-neighbors
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
✭ 181
Roff
python
spark
record-linkage
entity-resolution
fuzzy-matching
deduplication
em-algorithm
data-matching
deduplicate-data
snowman
Welcome to Snowman App – a Data Matching Benchmark Platform.
✭ 25
typescript
CSS
benchmark
matching
entity-resolution
duplicate-detection
kpis
snowman
data-matching
data-stewards
1-4
of
4
data-matching projects