stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (-20.34%)
Mutual labels: distance, levenshtein, jaro-winkler
Pg similarityset of functions and operators for executing similarity queries
Stars: ✭ 250 (+323.73%)
Mutual labels: similarity, levenshtein
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (+1.69%)
Mutual labels: levenshtein, jaro-winkler
strutilGolang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+93.22%)
Mutual labels: levenshtein, jaro-winkler
edits.crEdit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Stars: ✭ 16 (-72.88%)
Mutual labels: levenshtein, jaro-winkler
Node Damerau LevenshteinDamerau - Levenstein distance function for node
Stars: ✭ 27 (-54.24%)
Mutual labels: similarity, levenshtein
similar-english-wordsGive me a word and I’ll give you an array of words that differ by a single letter.
Stars: ✭ 25 (-57.63%)
Mutual labels: distance, levenshtein
Jellyfish🎐 a python library for doing approximate and phonetic matching of strings.
Stars: ✭ 1,571 (+2562.71%)
Mutual labels: levenshtein, jaro-winkler
StopwordsRemoves most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
Stars: ✭ 83 (+40.68%)
Mutual labels: distance, levenshtein
TextdistanceCompute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
Stars: ✭ 2,575 (+4264.41%)
Mutual labels: distance, levenshtein
Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+3972.88%)
Mutual labels: distance, jaro-winkler
Stringmetric🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
Stars: ✭ 481 (+715.25%)
Mutual labels: distance, levenshtein
levenshtein.cLevenshtein algorithm in C
Stars: ✭ 77 (+30.51%)
Mutual labels: similarity, levenshtein
eddieNo description or website provided.
Stars: ✭ 18 (-69.49%)
Mutual labels: levenshtein, jaro-winkler
SkeletonMatchingThis repository implements skeleton matching algorithm.
Stars: ✭ 30 (-49.15%)
Mutual labels: distance
BertSimilarityComputing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。
Stars: ✭ 348 (+489.83%)
Mutual labels: similarity
dice-coefficientSørensen–Dice coefficient
Stars: ✭ 37 (-37.29%)
Mutual labels: similarity
java-sdk一些常用的java sdk和工具类(日期工具类,分布式锁,redis缓存,二叉树,反射工具类,线程池,对称/非对称/分段加解密,json序列化,http工具,雪花算法,字符串相似度,集合操作工具,xml解析,重试Retry工具类,Jvm监控等)
Stars: ✭ 26 (-55.93%)
Mutual labels: levenshtein
nxontologyNetworkX-based Python library for representing ontologies
Stars: ✭ 45 (-23.73%)
Mutual labels: similarity