bedaBeda is a golang library for detecting how similar a two string
Stars: β 34 (+25.93%)
strutilGolang metrics for calculating string similarity and other string utility functions
Stars: β 114 (+322.22%)
Dedupeπ A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Stars: β 3,241 (+11903.7%)
LevenshteinThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: β 38 (+40.74%)
fuzzywuzzyFuzzy string matching for PHP
Stars: β 60 (+122.22%)
record-linkage-resourcesResources for tackling record linkage / deduplication / data matching problems
Stars: β 67 (+148.15%)
strsimstring similarity based on Dice's coefficient in go
Stars: β 39 (+44.44%)
Merge-MachineMerge Dirty Data with Clean Reference Tables
Stars: β 35 (+29.63%)
entity-embedPyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Stars: β 96 (+255.56%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: β 181 (+570.37%)
vmoPython Modules of Variable Markov Oracle
Stars: β 23 (-14.81%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: β 655 (+2325.93%)
TeamReferenceTeam reference for Competitive Programming. Algorithms implementations very used in the ACM-ICPC contests. Latex template to build your own team reference.
Stars: β 29 (+7.41%)
vbmlWay to check, match and resist.
Stars: β 27 (+0%)
wildmatchSimple string matching with questionmark- and star-wildcard operator
Stars: β 37 (+37.04%)
sinkhorn-label-allocationSinkhorn Label Allocation is a label assignment method for semi-supervised self-training algorithms. The SLA algorithm is described in full in this ICML 2021 paper: https://arxiv.org/abs/2102.08622.
Stars: β 49 (+81.48%)
ATGValidatoriOS validation framework with form validation support
Stars: β 51 (+88.89%)
gotoA fish shell utility to quickly navigate to aliased directories supporting tab-completion
Stars: β 17 (-37.04%)
PFACPFAC is an open library for exact string matching performed on NVIDIA GPUs
Stars: β 41 (+51.85%)
stringbenchString matching algorithm benchmark
Stars: β 31 (+14.81%)
AwesomeStanceLearningThe page lists recent research developments in the area of Stance Learning.
Stars: β 42 (+55.56%)
LibpostalA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Stars: β 3,312 (+12166.67%)
MongeAmpereFlowContinuous-time gradient flow for generative modeling and variational inference
Stars: β 29 (+7.41%)
simplematchMinimal, super readable string pattern matching for python.
Stars: β 147 (+444.44%)
seqalignCollection of sequence alignment algorithms.
Stars: β 20 (-25.93%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: β 204 (+655.56%)
alfBash Alias Generator and Manager
Stars: β 47 (+74.07%)
algosA collection of algorithms in rust
Stars: β 16 (-40.74%)
whatisWhatIs.this: simple entity resolution through Wikipedia
Stars: β 18 (-33.33%)
eddieNo description or website provided.
Stars: β 18 (-33.33%)
anonaddyMobile app for AnonAddy.com.
Stars: β 50 (+85.19%)
fuzzy-matchLibrary and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Stars: β 31 (+14.81%)
snowmanWelcome to Snowman App β a Data Matching Benchmark Platform.
Stars: β 25 (-7.41%)
artisan-aliasesSave keystrokes and run Artisan commands your way
Stars: β 23 (-14.81%)
Dotfilesβ rice ββ custom linux config files
Stars: β 1,514 (+5507.41%)
affinegapπ A Cython implementation of the affine gap string distance
Stars: β 57 (+111.11%)
conciliatorOpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Stars: β 95 (+251.85%)
effceeEffcee is a C++ library for stateful pattern matching of strings, inspired by LLVM's FileCheck
Stars: β 76 (+181.48%)
tipzGives you helpful hints when you execute a command for which you have an alias defined
Stars: β 24 (-11.11%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: β 60 (+122.22%)
fish-exaπ exa aliases for fish
Stars: β 24 (-11.11%)
ripzπ‘ ripgrep-powered zsh plugin alias reminder
Stars: β 23 (-14.81%)
dotfilesMy dotfiles
Stars: β 16 (-40.74%)
runnMake your own terminal aliases easily!
Stars: β 18 (-33.33%)
homesetupYour shell good as hell ! Not just dotfiles.
Stars: β 25 (-7.41%)
stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: β 47 (+74.07%)
multi string replaceA fast multiple string replace library for ruby. Uses a C implementation of the AhoβCorasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for on the fly multiple string replacement. Faster alternative to String.gsub when dealing with non-regex (exact match) use cases
Stars: β 16 (-40.74%)
pymolshortcutsThe repository pymolschortucts contains the a collection of shortcuts that are loaded on startup of PyMOL. These shortcuts enable websearches from within PyMOL as well as many other convienent functions that make work in PyMOL more productive..
Stars: β 34 (+25.93%)
alyCommand Line Alias Manager and Plugin System - Written in Golang
Stars: β 21 (-22.22%)
backpack.bashrc over ssh
Stars: β 24 (-11.11%)
MongeAmpereSolve large instance of semi-discrete optimal transport problems and other Monge-Ampere equations
Stars: β 18 (-33.33%)
node-red-contrib-stringProvides a string manipulation node with a chainable UI based on the concise and lightweight stringjs.com.
Stars: β 15 (-44.44%)