Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+4115.79%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+257.89%)
TextdistanceCompute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Stars: ✭ 2,575 (+4417.54%)
SymspellSymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+3366.67%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+229.82%)
customized-symspellJava port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
Stars: ✭ 51 (-10.53%)
polylevenFast Levenshtein Distance Library for Python 3
Stars: ✭ 37 (-35.09%)
LevenshteinThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: ✭ 38 (-33.33%)
LinSpellFast approximate strings search & spelling correction
Stars: ✭ 52 (-8.77%)
spellchecker-wasmSpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell
Stars: ✭ 46 (-19.3%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (+5.26%)
seqalign pathingRust implementation of sequence alignment / Levenshtein distance by A* acceleration of the DP algorithm
Stars: ✭ 17 (-70.18%)
stanceLearned string similarity for entity names using optimal transport.
Stars: ✭ 27 (-52.63%)
bedaBeda is a golang library for detecting how similar a two string
Stars: ✭ 34 (-40.35%)
seqalignCollection of sequence alignment algorithms.
Stars: ✭ 20 (-64.91%)
stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (-17.54%)
strutilGolang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+100%)
fuzzywuzzyFuzzy string matching for PHP
Stars: ✭ 60 (+5.26%)