strutilGolang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+90%)
bedaBeda is a golang library for detecting how similar a two string
Stars: ✭ 34 (-43.33%)
stanceLearned string similarity for entity names using optimal transport.
Stars: ✭ 27 (-55%)
fuzzy-matchLibrary and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
Stars: ✭ 31 (-48.33%)
fish-fzyfzy inegration with fish. Search history, navigate directories and more. Blazingly fast.
Stars: ✭ 18 (-70%)
FaintExtensible TUI fuzzy file file explorer
Stars: ✭ 82 (+36.67%)
fuzzychineseA small package to fuzzy match chinese words
Stars: ✭ 50 (-16.67%)
Fuse SwiftA lightweight fuzzy-search library, with zero dependencies
Stars: ✭ 767 (+1178.33%)
FuzzymatcherRecord linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4
Stars: ✭ 173 (+188.33%)
FuzzywuzzyJava fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Stars: ✭ 506 (+743.33%)
QuickenshteinMaking the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support
Stars: ✭ 204 (+240%)
fuzzy-matcherFuzzy Matching Library for Rust
Stars: ✭ 140 (+133.33%)
AbydosAbydos NLP/IR library for Python
Stars: ✭ 91 (+51.67%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (+0%)
TntsearchA fully featured full text search engine written in PHP
Stars: ✭ 2,693 (+4388.33%)
Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+3905%)
stringosimString similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (-21.67%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (-25%)
SymspellpyPython port of SymSpell
Stars: ✭ 420 (+600%)
php aho corasickAho-Corasick string search algorithm PHP extension implementation.
Stars: ✭ 45 (-25%)
multi string replaceA fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for on the fly multiple string replacement. Faster alternative to String.gsub when dealing with non-regex (exact match) use cases
Stars: ✭ 16 (-73.33%)
Liquidmetal💦🤘 A mimetic poly-alloy of the Quicksilver scoring algorithm, essentially LiquidMetal. </Schwarzenegger Voice>
Stars: ✭ 279 (+365%)
fuzzy-searchA collection of algorithms for fuzzy search like in Sublime Text.
Stars: ✭ 49 (-18.33%)
FastenshteinThe fastest .Net Levenshtein around
Stars: ✭ 115 (+91.67%)
bolt.nvim⚡ Ultrafast multi-pane file manager for Neovim with fuzzy matching
Stars: ✭ 100 (+66.67%)
Fuzzball.jsEasy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Stars: ✭ 225 (+275%)
RefinrCluster and merge similar char values: an R implementation of Open Refine clustering algorithms
Stars: ✭ 91 (+51.67%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+991.67%)
spaczzFuzzy matching and more functionality for spaCy.
Stars: ✭ 215 (+258.33%)
Yoyo-leafYoyo-leaf is an awesome command-line fuzzy finder.
Stars: ✭ 49 (-18.33%)
GitgotSemi-automated, feedback-driven tool to rapidly search through troves of public data on GitHub for sensitive secrets.
Stars: ✭ 964 (+1506.67%)
strsimstring similarity based on Dice's coefficient in go
Stars: ✭ 39 (-35%)
LevenshteinThe Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
Stars: ✭ 38 (-36.67%)
seqalignCollection of sequence alignment algorithms.
Stars: ✭ 20 (-66.67%)
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+873.33%)
affinegap📐 A Cython implementation of the affine gap string distance
Stars: ✭ 57 (-5%)
fuzzywuzzyRfuzzy string matching in R
Stars: ✭ 32 (-46.67%)
Persian ToolsAn anthology of a variety of tools for the Persian language in javascript
Stars: ✭ 458 (+663.33%)
Toolgood.words一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
Stars: ✭ 2,785 (+4541.67%)
FuzzysearchFind parts of long text or data, allowing for some changes/typos.
Stars: ✭ 157 (+161.67%)
ClosestmatchGolang library for fuzzy matching within a set of strings 📃
Stars: ✭ 353 (+488.33%)
stringbenchString matching algorithm benchmark
Stars: ✭ 31 (-48.33%)
effceeEffcee is a C++ library for stateful pattern matching of strings, inspired by LLVM's FileCheck
Stars: ✭ 76 (+26.67%)
TeamReferenceTeam reference for Competitive Programming. Algorithms implementations very used in the ACM-ICPC contests. Latex template to build your own team reference.
Stars: ✭ 29 (-51.67%)
Re FlexThe regex-centric, fast lexical analyzer generator for C++ with full Unicode support. Faster than Flex. Accepts Flex specifications. Generates reusable source code that is easy to understand. Introduces indent/dedent anchors, lazy quantifiers, functions for lex/syntax error reporting, and more. Seamlessly integrates with Bison and other parsers.
Stars: ✭ 274 (+356.67%)
wildmatchSimple string matching with questionmark- and star-wildcard operator
Stars: ✭ 37 (-38.33%)
SymspellSymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+3193.33%)
SymSpellCppPyFast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-53.33%)
ATGValidatoriOS validation framework with form validation support
Stars: ✭ 51 (-15%)
PFACPFAC is an open library for exact string matching performed on NVIDIA GPUs
Stars: ✭ 41 (-31.67%)
LeaderfAn efficient fuzzy finder that helps to locate files, buffers, mrus, gtags, etc. on the fly for both vim and neovim.
Stars: ✭ 1,733 (+2788.33%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+201.67%)