String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...

Stars: ✭ 47 (+38.24%)

Mutual labels: string-distance

string-similarity-js

Lightweight string similarity function for javascript

Stars: ✭ 29 (-14.71%)

Mutual labels: string-similarity

TeamReference

Team reference for Competitive Programming. Algorithms implementations very used in the ACM-ICPC contests. Latex template to build your own team reference.

Stars: ✭ 29 (-14.71%)

Mutual labels: string-matching

wildmatch

Simple string matching with questionmark- and star-wildcard operator

Stars: ✭ 37 (+8.82%)

Mutual labels: string-matching

AnyDiff

A CSharp (C#) diff library that allows you to diff two objects and get a list of the differences back.

Stars: ✭ 80 (+135.29%)

Mutual labels: difference

hyperdiff

Find common, removed and added element between two collections.

Stars: ✭ 14 (-58.82%)

Mutual labels: difference

effcee

Effcee is a C++ library for stateful pattern matching of strings, inspired by LLVM's FileCheck

Stars: ✭ 76 (+123.53%)

Mutual labels: string-matching

algos

A collection of algorithms in rust

Stars: ✭ 16 (-52.94%)

Mutual labels: string-matching

fuzzy-match

Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.

Stars: ✭ 31 (-8.82%)

Mutual labels: string-matching

stringdistance

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..

Stars: ✭ 60 (+76.47%)

Mutual labels: string-similarity

node-red-contrib-string

Provides a string manipulation node with a chainable UI based on the concise and lightweight stringjs.com.

Stars: ✭ 15 (-55.88%)

Mutual labels: string-matching

simplematch

Minimal, super readable string pattern matching for python.

Stars: ✭ 147 (+332.35%)

Mutual labels: string-matching

vmo

Python Modules of Variable Markov Oracle

Stars: ✭ 23 (-32.35%)

Mutual labels: string-matching

Quickenshtein

Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support

Stars: ✭ 204 (+500%)

Mutual labels: string-distance

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-26.47%)

Mutual labels: difference

vbml

Way to check, match and resist.

Stars: ✭ 27 (-20.59%)

Mutual labels: string-matching

eddie

No description or website provided.

Stars: ✭ 18 (-47.06%)

Mutual labels: string-similarity

ATGValidator

iOS validation framework with form validation support

Stars: ✭ 51 (+50%)

Mutual labels: string-matching

affinegap

📐 A Cython implementation of the affine gap string distance

Stars: ✭ 57 (+67.65%)

Mutual labels: string-distance

PFAC

PFAC is an open library for exact string matching performed on NVIDIA GPUs

Stars: ✭ 41 (+20.59%)

Mutual labels: string-matching

simd-byte-lookup

SIMDized check which bytes are in a set

Stars: ✭ 23 (-32.35%)

Mutual labels: string-matching

levenshtein.c

Levenshtein algorithm in C

Stars: ✭ 77 (+126.47%)

Mutual labels: string-matching

levenshtein finder

Similar string search in Levenshtein distance

Stars: ✭ 19 (-44.12%)

Mutual labels: string-distance

Java String Similarity

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

Stars: ✭ 2,403 (+6967.65%)

Mutual labels: string-distance

Diff Match Patch

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.

Stars: ✭ 4,910 (+14341.18%)

Mutual labels: difference

Differencekit

💻 A fast and flexible O(n) difference algorithm framework for Swift collection.

Stars: ✭ 2,986 (+8682.35%)

Mutual labels: difference

unikmer

Toolkit for k-mer with taxonomic information

Stars: ✭ 46 (+35.29%)

Mutual labels: difference

Toolgood.words

一款高性能敏感词(非法词/脏字)检测过滤组件，附带繁体简体互换，支持全角半角互换，汉字转拼音，模糊搜索等功能。

Stars: ✭ 2,785 (+8091.18%)

Mutual labels: string-matching

php aho corasick

Aho-Corasick string search algorithm PHP extension implementation.

Stars: ✭ 45 (+32.35%)

Mutual labels: string-matching

node-validator

Node.js string validator for more

Stars: ✭ 14 (-58.82%)

Mutual labels: string-matching

multi string replace

A fast multiple string replace library for ruby. Uses a C implementation of the Aho–Corasick Algorithm based on https://github.com/morenice/ahocorasick while adding support for on the fly multiple string replacement. Faster alternative to String.gsub when dealing with non-regex (exact match) use cases

Stars: ✭ 16 (-52.94%)

Mutual labels: string-matching

stringbench

String matching algorithm benchmark

Stars: ✭ 31 (-8.82%)

Mutual labels: string-matching

String Similarity

Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.

Stars: ✭ 2,254 (+6529.41%)

Mutual labels: string-similarity

UMICollapse

Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.

Stars: ✭ 31 (-8.82%)

Mutual labels: string-similarity

1-41 of 41 similar projects