stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (+147.37%)
strutilGolang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+500%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+973.68%)
Pg similarityset of functions and operators for executing similarity queries
Stars: ✭ 250 (+1215.79%)
Fuzzball.jsEasy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Stars: ✭ 225 (+1084.21%)
Strsim Rs🔤 Rust implementations of string similarity metrics
Stars: ✭ 210 (+1005.26%)
TextdistanceCompute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Stars: ✭ 2,575 (+13452.63%)
SymspellSymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+10300%)
LevenshteinGo implementation to calculate Levenshtein Distance.
Stars: ✭ 125 (+557.89%)
DictomatonFinite state dictionaries in Java
Stars: ✭ 124 (+552.63%)
FastenshteinThe fastest .Net Levenshtein around
Stars: ✭ 115 (+505.26%)
Jellyfish🎐 a python library for doing approximate and phonetic matching of strings.
Stars: ✭ 1,571 (+8168.42%)
AbydosAbydos NLP/IR library for Python
Stars: ✭ 91 (+378.95%)
StopwordsRemoves most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
Stars: ✭ 83 (+336.84%)
Str metricsRuby gem (native extension in Rust) providing implementations of various string metrics
Stars: ✭ 68 (+257.89%)
Edit DistancePython library for computing edit distance between arbitrary Python sequences.
Stars: ✭ 61 (+221.05%)
SymspellcompoundSymSpellCompound: compound aware automatic spelling correction
Stars: ✭ 61 (+221.05%)
LevenshteinLevenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.
Stars: ✭ 57 (+200%)
RapidfuzzRapid fuzzy string matching in Python using the Levenshtein Distance
Stars: ✭ 809 (+4157.89%)
Stringmetric🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
Stars: ✭ 481 (+2431.58%)
SymspellpyPython port of SymSpell
Stars: ✭ 420 (+2110.53%)
ClosestmatchGolang library for fuzzy matching within a set of strings 📃
Stars: ✭ 353 (+1757.89%)
Js LevenshteinThe most efficient JS implementation calculating the Levenshtein distance, i.e. the difference between two strings.
Stars: ✭ 269 (+1315.79%)
Go EdlibGolang string comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...
Stars: ✭ 253 (+1231.58%)
similar-english-wordsGive me a word and I’ll give you an array of words that differ by a single letter.
Stars: ✭ 25 (+31.58%)
hubot-suggestSuggest hubot commands when not found
Stars: ✭ 29 (+52.63%)
simetricString similarity metrics for Elixir
Stars: ✭ 59 (+210.53%)
LevenshteinThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: ✭ 38 (+100%)
edits.crEdit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Stars: ✭ 16 (-15.79%)
java-sdk一些常用的java sdk和工具类(日期工具类,分布式锁,redis缓存,二叉树,反射工具类,线程池,对称/非对称/分段加解密,json序列化,http工具,雪花算法,字符串相似度,集合操作工具,xml解析,重试Retry工具类,Jvm监控等)
Stars: ✭ 26 (+36.84%)
LinSpellFast approximate strings search & spelling correction
Stars: ✭ 52 (+173.68%)
spellchecker-wasmSpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell
Stars: ✭ 46 (+142.11%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (+215.79%)
edit-distance-papersA curated list of papers dedicated to edit-distance as objective function
Stars: ✭ 49 (+157.89%)
eddieNo description or website provided.
Stars: ✭ 18 (-5.26%)
Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+12547.37%)
stanceLearned string similarity for entity names using optimal transport.
Stars: ✭ 27 (+42.11%)
bedaBeda is a golang library for detecting how similar a two string
Stars: ✭ 34 (+78.95%)
seqalignCollection of sequence alignment algorithms.
Stars: ✭ 20 (+5.26%)
fuzzywuzzyFuzzy string matching for PHP
Stars: ✭ 60 (+215.79%)
affinegap📐 A Cython implementation of the affine gap string distance
Stars: ✭ 57 (+200%)