All Projects → stringdistance → Similar Projects or Alternatives

110 Open source projects that are alternatives of or similar to stringdistance

strutil
Golang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+90%)
eddie
No description or website provided.
Stars: ✭ 18 (-70%)
stringosim
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (-21.67%)
set-sketch-paper
SetSketch: Filling the Gap between MinHash and HyperLogLog
Stars: ✭ 23 (-61.67%)
Symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+3193.33%)
Jellyfish
🎐 a python library for doing approximate and phonetic matching of strings.
Stars: ✭ 1,571 (+2518.33%)
Mutual labels:  levenshtein, jaro-winkler, soundex
levenshtein.c
Levenshtein algorithm in C
Stars: ✭ 77 (+28.33%)
edits.cr
Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Stars: ✭ 16 (-73.33%)
Mutual labels:  levenshtein, jaro-winkler, jaro
spark-stringmetric
Spark functions to run popular phonetic and string matching algorithms
Stars: ✭ 51 (-15%)
Java String Similarity
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+3905%)
Textdistance
Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Stars: ✭ 2,575 (+4191.67%)
Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: ✭ 38 (-36.67%)
Fastenshtein
The fastest .Net Levenshtein around
Stars: ✭ 115 (+91.67%)
Mutual labels:  fuzzy-matching, levenshtein
Abydos
Abydos NLP/IR library for Python
Stars: ✭ 91 (+51.67%)
Mutual labels:  fuzzy-matching, levenshtein
Symspellpy
Python port of SymSpell
Stars: ✭ 420 (+600%)
Mutual labels:  fuzzy-matching, levenshtein
Closestmatch
Golang library for fuzzy matching within a set of strings 📃
Stars: ✭ 353 (+488.33%)
Mutual labels:  fuzzy-matching, levenshtein
String Similarity
Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
Stars: ✭ 2,254 (+3656.67%)
Fuzzball.js
Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Stars: ✭ 225 (+275%)
Mutual labels:  fuzzy-matching, levenshtein
strsim
string similarity based on Dice's coefficient in go
Stars: ✭ 39 (-35%)
Quickenshtein
Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+240%)
spellchecker-wasm
SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell
Stars: ✭ 46 (-23.33%)
tika-similarity
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Stars: ✭ 92 (+53.33%)
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-60%)
Mutual labels:  jaro-winkler, hamming-distance
LinSpell
Fast approximate strings search & spelling correction
Stars: ✭ 52 (-13.33%)
simetric
String similarity metrics for Elixir
Stars: ✭ 59 (-1.67%)
Mutual labels:  levenshtein, jaro-winkler
Content-based-Recommender-System
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Stars: ✭ 64 (+6.67%)
Mutual labels:  cosine-similarity
affinegap
📐 A Cython implementation of the affine gap string distance
Stars: ✭ 57 (-5%)
Mutual labels:  levenshtein-distance
Faint
Extensible TUI fuzzy file file explorer
Stars: ✭ 82 (+36.67%)
Mutual labels:  fuzzy-matching
Refinr
Cluster and merge similar char values: an R implementation of Open Refine clustering algorithms
Stars: ✭ 91 (+51.67%)
Mutual labels:  fuzzy-matching
Movie-Recommendation-System-with-Sentiment-Analysis
This is a Machine Learning project to create a "Movie Recommender System" and predict user ratings for movies using cosine similarity.
Stars: ✭ 21 (-65%)
Mutual labels:  cosine-similarity
bns-short-text-similarity
📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-60%)
Mutual labels:  cosine-similarity
Gitgot
Semi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.
Stars: ✭ 964 (+1506.67%)
Mutual labels:  fuzzy-matching
Fuse Swift
A lightweight fuzzy-search library, with zero dependencies
Stars: ✭ 767 (+1178.33%)
Mutual labels:  fuzzy-matching
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+873.33%)
Mutual labels:  fuzzy-matching
fuzzychinese
A small package to fuzzy match chinese words
Stars: ✭ 50 (-16.67%)
Mutual labels:  fuzzy-matching
lsh-rs
Locality Sensitive Hashing in Rust with Python bindings
Stars: ✭ 64 (+6.67%)
Mutual labels:  cosine-similarity
koolsla
Food recommendation tool with Machine learning.
Stars: ✭ 21 (-65%)
Mutual labels:  cosine-similarity
Fuzzywuzzy
Java fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Stars: ✭ 506 (+743.33%)
Mutual labels:  fuzzy-matching
spaczz
Fuzzy matching and more functionality for spaCy.
Stars: ✭ 215 (+258.33%)
Mutual labels:  fuzzy-matching
Persian Tools
An anthology of a variety of tools for the Persian language in javascript
Stars: ✭ 458 (+663.33%)
Mutual labels:  fuzzy-matching
double-metaphone
Fast Double Metaphone algorithm
Stars: ✭ 70 (+16.67%)
Mutual labels:  soundex
levenshtein-edit-distance
Levenshtein edit distance
Stars: ✭ 59 (-1.67%)
Mutual labels:  levenshtein
Liquidmetal
💦🤘 A mimetic poly-alloy of the Quicksilver scoring algorithm, essentially LiquidMetal. </Schwarzenegger Voice>
Stars: ✭ 279 (+365%)
Mutual labels:  fuzzy-matching
Re Flex
The regex-centric, fast lexical analyzer generator for C++ with full Unicode support. Faster than Flex. Accepts Flex specifications. Generates reusable source code that is easy to understand. Introduces indent/dedent anchors, lazy quantifiers, functions for lex/syntax error reporting, and more. Seamlessly integrates with Bison and other parsers.
Stars: ✭ 274 (+356.67%)
Mutual labels:  fuzzy-matching
vue-persian-tools
Persian tools wrapper for vue.js
Stars: ✭ 21 (-65%)
Mutual labels:  fuzzy-matching
SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-53.33%)
Mutual labels:  fuzzy-matching
edit-distance-papers
A curated list of papers dedicated to edit-distance as objective function
Stars: ✭ 49 (-18.33%)
Mutual labels:  levenshtein
solr-vector-scoring
Vector Plugin for Solr: calculate dot product / cosine similarity on documents
Stars: ✭ 28 (-53.33%)
Mutual labels:  cosine-similarity
fuzzywuzzy
Fuzzy string matching for PHP
Stars: ✭ 60 (+0%)
Mutual labels:  fuzzy-matching
levenshtein finder
Similar string search in Levenshtein distance
Stars: ✭ 19 (-68.33%)
Mutual labels:  levenshtein
splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+201.67%)
Mutual labels:  fuzzy-matching
fuzzy-search
A collection of algorithms for fuzzy search like in Sublime Text.
Stars: ✭ 49 (-18.33%)
Mutual labels:  fuzzy-matching
fuzzy-matcher
Fuzzy Matching Library for Rust
Stars: ✭ 140 (+133.33%)
Mutual labels:  fuzzy-matching
bolt.nvim
⚡ Ultrafast multi-pane file manager for Neovim with fuzzy matching
Stars: ✭ 100 (+66.67%)
Mutual labels:  fuzzy-matching
Tntsearch
A fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+4388.33%)
Mutual labels:  fuzzy-matching
fish-fzy
fzy inegration with fish. Search history, navigate directories and more. Blazingly fast.
Stars: ✭ 18 (-70%)
Mutual labels:  fuzzy-matching
fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Stars: ✭ 31 (-48.33%)
Mutual labels:  fuzzy-matching
Data Matching Software
A list of free data matching and record linkage software.
Stars: ✭ 206 (+243.33%)
Mutual labels:  fuzzy-matching
Pg similarity
set of functions and operators for executing similarity queries
Stars: ✭ 250 (+316.67%)
Mutual labels:  levenshtein
seqalign pathing
Rust implementation of sequence alignment / Levenshtein distance by A* acceleration of the DP algorithm
Stars: ✭ 17 (-71.67%)
Mutual labels:  levenshtein-distance
1-60 of 110 similar projects