All Projects → jamesturk → Jellyfish

jamesturk / Jellyfish

Licence: bsd-2-clause
🎐 a python library for doing approximate and phonetic matching of strings.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to Jellyfish

eddie
No description or website provided.
Stars: ✭ 18 (-98.85%)
Mutual labels:  levenshtein, jaro-winkler, hamming
stringdistance
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (-96.18%)
Mutual labels:  levenshtein, jaro-winkler, soundex
double-metaphone
Fast Double Metaphone algorithm
Stars: ✭ 70 (-95.54%)
Mutual labels:  soundex, metaphone
ceja
PySpark phonetic and string matching algorithms
Stars: ✭ 24 (-98.47%)
Mutual labels:  jaro-winkler, metaphone
edits.cr
Edit distance algorithms inc. Jaro, Damerau-Levenshtein, and Optimal Alignment
Stars: ✭ 16 (-98.98%)
Mutual labels:  levenshtein, jaro-winkler
Fuzzball.js
Easy to use and powerful fuzzy string matching, port of fuzzywuzzy.
Stars: ✭ 225 (-85.68%)
Mutual labels:  fuzzy-search, levenshtein
levenshtein.c
Levenshtein algorithm in C
Stars: ✭ 77 (-95.1%)
Mutual labels:  fuzzy-search, levenshtein
simetric
String similarity metrics for Elixir
Stars: ✭ 59 (-96.24%)
Mutual labels:  levenshtein, jaro-winkler
phonix
Phonetic libray for .NET
Stars: ✭ 63 (-95.99%)
Mutual labels:  soundex, metaphone
strutil
Golang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (-92.74%)
Mutual labels:  levenshtein, jaro-winkler
Symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+25.78%)
Mutual labels:  fuzzy-search, levenshtein
Symspellpy
Python port of SymSpell
Stars: ✭ 420 (-73.27%)
Mutual labels:  fuzzy-search, levenshtein
LinSpell
Fast approximate strings search & spelling correction
Stars: ✭ 52 (-96.69%)
Mutual labels:  fuzzy-search, levenshtein
phonetic-algorithms
Phonetic-Algorithms for fuzzy searching | PHP
Stars: ✭ 14 (-99.11%)
Mutual labels:  fuzzy-search, metaphone
stringosim
String similarity functions, String distance's, Jaccard, Levenshtein, Hamming, Jaro-Winkler, Q-grams, N-grams, LCS - Longest Common Subsequence, Cosine similarity...
Stars: ✭ 47 (-97.01%)
Mutual labels:  levenshtein, jaro-winkler
Symspellcompound
SymSpellCompound: compound aware automatic spelling correction
Stars: ✭ 61 (-96.12%)
Mutual labels:  fuzzy-search, levenshtein
Node Damerau Levenshtein
Damerau - Levenstein distance function for node
Stars: ✭ 27 (-98.28%)
Mutual labels:  levenshtein
Str metrics
Ruby gem (native extension in Rust) providing implementations of various string metrics
Stars: ✭ 68 (-95.67%)
Mutual labels:  levenshtein
Flexsearch
Next-Generation full text search library for Browser and Node.js
Stars: ✭ 8,108 (+416.1%)
Mutual labels:  fuzzy-search
Inquirer Checkbox Plus Prompt
Checkbox with autocomplete and other additions for Inquirer
Stars: ✭ 25 (-98.41%)
Mutual labels:  fuzzy-search

Overview

jellyfish is a library for approximate & phonetic matching of strings.

Source: https://github.com/jamesturk/jellyfish

Documentation: https://jamesturk.github.io/jellyfish/

Issues: https://github.com/jamesturk/jellyfish/issues

PyPI badge Test badge Coveralls

Included Algorithms

String comparison:

  • Levenshtein Distance
  • Damerau-Levenshtein Distance
  • Jaro Distance
  • Jaro-Winkler Distance
  • Match Rating Approach Comparison
  • Hamming Distance

Phonetic encoding:

  • American Soundex
  • Metaphone
  • NYSIIS (New York State Identification and Intelligence System)
  • Match Rating Codex

Example Usage

>>> import jellyfish
>>> jellyfish.levenshtein_distance(u'jellyfish', u'smellyfish')
2
>>> jellyfish.jaro_distance(u'jellyfish', u'smellyfish')
0.89629629629629637
>>> jellyfish.damerau_levenshtein_distance(u'jellyfish', u'jellyfihs')
1

>>> jellyfish.metaphone(u'Jellyfish')
'JLFX'
>>> jellyfish.soundex(u'Jellyfish')
'J412'
>>> jellyfish.nysiis(u'Jellyfish')
'JALYF'
>>> jellyfish.match_rating_codex(u'Jellyfish')
'JLLFSH'
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].