strutilGolang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+235.29%)
stanceLearned string similarity for entity names using optimal transport.
Stars: ✭ 27 (-20.59%)
strsimstring similarity based on Dice's coefficient in go
Stars: ✭ 39 (+14.71%)
LevenshteinThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: ✭ 38 (+11.76%)
fuzzywuzzyFuzzy string matching for PHP
Stars: ✭ 60 (+76.47%)
seqalignCollection of sequence alignment algorithms.
Stars: ✭ 20 (-41.18%)
stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (+38.24%)
TeamReferenceTeam reference for Competitive Programming. Algorithms implementations very used in the ACM-ICPC contests. Latex template to build your own team reference.
Stars: ✭ 29 (-14.71%)
wildmatchSimple string matching with questionmark- and star-wildcard operator
Stars: ✭ 37 (+8.82%)
AnyDiffA CSharp (C#) diff library that allows you to diff two objects and get a list of the differences back.
Stars: ✭ 80 (+135.29%)
hyperdiffFind common, removed and added element between two collections.
Stars: ✭ 14 (-58.82%)
effceeEffcee is a C++ library for stateful pattern matching of strings, inspired by LLVM's FileCheck
Stars: ✭ 76 (+123.53%)
algosA collection of algorithms in rust
Stars: ✭ 16 (-52.94%)
fuzzy-matchLibrary and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Stars: ✭ 31 (-8.82%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (+76.47%)
node-red-contrib-stringProvides a string manipulation node with a chainable UI based on the concise and lightweight stringjs.com.
Stars: ✭ 15 (-55.88%)
simplematchMinimal, super readable string pattern matching for python.
Stars: ✭ 147 (+332.35%)
vmoPython Modules of Variable Markov Oracle
Stars: ✭ 23 (-32.35%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+500%)
vbmlWay to check, match and resist.
Stars: ✭ 27 (-20.59%)
eddieNo description or website provided.
Stars: ✭ 18 (-47.06%)
ATGValidatoriOS validation framework with form validation support
Stars: ✭ 51 (+50%)
affinegap📐 A Cython implementation of the affine gap string distance
Stars: ✭ 57 (+67.65%)
PFACPFAC is an open library for exact string matching performed on NVIDIA GPUs
Stars: ✭ 41 (+20.59%)
Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+6967.65%)
Diff Match PatchDiff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+14341.18%)
Differencekit💻 A fast and flexible O(n) difference algorithm framework for Swift collection.
Stars: ✭ 2,986 (+8682.35%)
unikmerToolkit for k-mer with taxonomic information
Stars: ✭ 46 (+35.29%)
Toolgood.words一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
Stars: ✭ 2,785 (+8091.18%)
php aho corasickAho-Corasick string search algorithm PHP extension implementation.
Stars: ✭ 45 (+32.35%)
multi string replaceA fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for on the fly multiple string replacement. Faster alternative to String.gsub when dealing with non-regex (exact match) use cases
Stars: ✭ 16 (-52.94%)
stringbenchString matching algorithm benchmark
Stars: ✭ 31 (-8.82%)
String SimilarityFinds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.
Stars: ✭ 2,254 (+6529.41%)
UMICollapseAccelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
Stars: ✭ 31 (-8.82%)