All Projects → zingg → Similar Projects or Alternatives

792 Open source projects that are alternatives of or similar to zingg

splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (-72.37%)
yadf
Yet Another Dupes Finder
Stars: ✭ 32 (-95.11%)
Mutual labels:  dedupe, deduplication
Dedupe
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Stars: ✭ 3,241 (+394.81%)
Mutual labels:  dedupe, entity-resolution
Data Matching Software
A list of free data matching and record linkage software.
Stars: ✭ 206 (-68.55%)
Mutual labels:  fuzzy-matching, deduplication
mail-deduplicate
📧 CLI to deduplicate mails from mail boxes.
Stars: ✭ 134 (-79.54%)
Mutual labels:  dedupe, deduplication
dduper
Fast block-level out-of-band BTRFS deduplication tool.
Stars: ✭ 108 (-83.51%)
Mutual labels:  dedupe, deduplication
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (-66.56%)
Mutual labels:  etl, data-transformation
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (-10.84%)
Mutual labels:  fuzzy-matching, deduplication
record-linkage-resources
Resources for tackling record linkage / deduplication / data matching problems
Stars: ✭ 67 (-89.77%)
Mutual labels:  entity-resolution, deduplication
entity-embed
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Stars: ✭ 96 (-85.34%)
Mutual labels:  entity-resolution, deduplication
DQCS
数据质量控制系统
Stars: ✭ 34 (-94.81%)
Mutual labels:  etl, dataquality
Restic
Fast, secure, efficient backup program
Stars: ✭ 15,105 (+2206.11%)
Mutual labels:  dedupe, deduplication
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-94.05%)
Mutual labels:  etl, datalake
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-93.28%)
Mutual labels:  etl, data-transformation
YaEtl
Yet Another ETL in PHP
Stars: ✭ 60 (-90.84%)
Mutual labels:  etl
FlutterIOT
Visit our website for more Mobile and Web applications
Stars: ✭ 66 (-89.92%)
Mutual labels:  ml
BETL-old
BETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-97.4%)
Mutual labels:  etl
ID-Card-Passport-Recognition-SDK-Android
On-Device ID Card & Passport & Driver License Recognition SDK for Android
Stars: ✭ 223 (-65.95%)
Mutual labels:  identity
neptune-client
📒 Experiment tracking tool and model registry
Stars: ✭ 348 (-46.87%)
Mutual labels:  ml
dask-sql
Distributed SQL Engine in Python using Dask
Stars: ✭ 271 (-58.63%)
Mutual labels:  ml
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
Stars: ✭ 1,535 (+134.35%)
Mutual labels:  datalake
RE-VERB
speaker diarization system using an LSTM
Stars: ✭ 22 (-96.64%)
Mutual labels:  ml
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-97.71%)
Mutual labels:  etl
lm-scorer
📃Language Model based sentences scoring library
Stars: ✭ 264 (-59.69%)
Mutual labels:  ml
zdh server
数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (-91.91%)
Mutual labels:  etl
zpaqfranz
Deduplicating archiver with encryption and paranoid-level tests. Swiss army knife for the serious backup and disaster recovery manager. Ransomware neutralizer. Win/Linux/Unix
Stars: ✭ 86 (-86.87%)
Mutual labels:  deduplication
neural inverse knitting
Code for Neural Inverse Knitting: From Images to Manufacturing Instructions
Stars: ✭ 30 (-95.42%)
Mutual labels:  ml
blockstack.js-old
The Blockstack JS library for identity and authentication
Stars: ✭ 20 (-96.95%)
Mutual labels:  identity
cogito
Cogito Identity Management https://cogito.mobi
Stars: ✭ 14 (-97.86%)
Mutual labels:  identity
predict Lottery ticket
双色球+大乐透彩票AI预测
Stars: ✭ 341 (-47.94%)
Mutual labels:  ml
fuzzychinese
A small package to fuzzy match chinese words
Stars: ✭ 50 (-92.37%)
Mutual labels:  fuzzy-matching
Hacktoberfest-2k19
Just add pull requests to this repo and stand a chance to win a limited edition Hacktoberfest T-shirt.
Stars: ✭ 33 (-94.96%)
Mutual labels:  ml
django-data-migration
Data migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (-97.25%)
Mutual labels:  etl
fuzzywuzzy
Fuzzy string matching for PHP
Stars: ✭ 60 (-90.84%)
Mutual labels:  fuzzy-matching
DeepBump
Normal & height maps generation from single pictures
Stars: ✭ 185 (-71.76%)
Mutual labels:  ml
apiary-data-lake
Terraform scripts for deploying Apiary Data Lake
Stars: ✭ 15 (-97.71%)
Mutual labels:  datalake
poa-popa
DApp for proof of physical address (PoPA) attestation for validators of POA Network
Stars: ✭ 22 (-96.64%)
Mutual labels:  identity
socrates
PHP package to Validate and Extract information from National Identification Numbers.
Stars: ✭ 46 (-92.98%)
Mutual labels:  identity
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-88.24%)
Mutual labels:  etl
winter
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
Stars: ✭ 101 (-84.58%)
Mutual labels:  identity-resolution
Yoyo-leaf
Yoyo-leaf is an awesome command-line fuzzy finder.
Stars: ✭ 49 (-92.52%)
Mutual labels:  fuzzy-matching
card-scanner-flutter
A flutter package for Fast, Accurate and Secure Credit card & Debit card scanning
Stars: ✭ 82 (-87.48%)
Mutual labels:  ml
mlapp
MLApp is a Python library for building scalable data science solutions that meet modern software engineering standards.
Stars: ✭ 42 (-93.59%)
Mutual labels:  ml
CustomVisionMicrosoftToCoreMLDemoApp
This app recognises 3 hand signs - fist, high five and victory hand [ rock, paper, scissors basically :) ] with live feed camera. It uses a HandSigns.mlmodel which has been trained using Custom Vision from Microsoft.
Stars: ✭ 25 (-96.18%)
Mutual labels:  ml
Learning-Resources
This repository contains curated, useful resources drafted by DSC Domain Leads.
Stars: ✭ 21 (-96.79%)
Mutual labels:  ml
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (-88.4%)
Mutual labels:  etl
active-directory-android
An android app that uses Azure AD and the ADAL library for authenticating the user and calling a web API using OAuth 2.0 access tokens.
Stars: ✭ 33 (-94.96%)
Mutual labels:  identity
AspNetCore.Identity.RavenDB
RavenDB Storage Provider for ASP.NET Core Identity
Stars: ✭ 16 (-97.56%)
Mutual labels:  identity
r2inference
RidgeRun Inference Framework
Stars: ✭ 22 (-96.64%)
Mutual labels:  ml
DevSoc21
Official website for DEVSOC 21, our annual flagship hackathon.
Stars: ✭ 15 (-97.71%)
Mutual labels:  ml
facematch
Facematch is a tool to verifies if two photos contain the same person.
Stars: ✭ 62 (-90.53%)
Mutual labels:  identity
ml-graphlab-boilerplate
Machine learning boiler plate to get you started in minutes (graphlab + sframe + jupyter + docker)
Stars: ✭ 17 (-97.4%)
Mutual labels:  ml
Authentication
Authentication examples for AspNetCore 3.1
Stars: ✭ 37 (-94.35%)
Mutual labels:  identity
FlowMaster
ETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-97.1%)
Mutual labels:  etl
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (-55.42%)
Mutual labels:  etl
identity-site
This is the Login.gov main website where the public is able to learn about their one account for government.
Stars: ✭ 28 (-95.73%)
Mutual labels:  identity
django-super-deduper
Utilities for de-duping Django model instances
Stars: ✭ 27 (-95.88%)
Mutual labels:  dedupe
identityazuretable
This project provides a high performance cloud solution for ASP.NET Identity Core using Azure Table storage replacing the Entity Framework / MSSQL provider.
Stars: ✭ 97 (-85.19%)
Mutual labels:  identity
leetspeek
Open and collaborative content from leet hackers!
Stars: ✭ 11 (-98.32%)
Mutual labels:  ml
opus
No description or website provided.
Stars: ✭ 22 (-96.64%)
Mutual labels:  identity
1-60 of 792 similar projects