All Projects → mlampros → fuzzywuzzyR

mlampros / fuzzywuzzyR

Licence: other
fuzzy string matching in R

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to fuzzywuzzyR

Competitive Programming Repository
Competitive Programming templates that I used during the past few years.
Stars: ✭ 367 (+1046.88%)
Mutual labels:  string, matching
aho-corasick-node
A Node implementation of the Aho-Corasick string matching algorithm based on DoubleArray Trie.
Stars: ✭ 16 (-50%)
Mutual labels:  string, matching
as-string-sink
An efficient dynamically sized string buffer (aka String Builder) for AssemblyScript
Stars: ✭ 23 (-28.12%)
Mutual labels:  string
snowman
Welcome to Snowman App – a Data Matching Benchmark Platform.
Stars: ✭ 25 (-21.87%)
Mutual labels:  matching
speedy-net
Speedy Net, Speedy Match and Speedy Composer
Stars: ✭ 16 (-50%)
Mutual labels:  matching
FAIR.m
Flexible Algorithms for Image Registration
Stars: ✭ 103 (+221.88%)
Mutual labels:  matching
hydrate-text
A small, dependency-free and strongly typed template engine.
Stars: ✭ 45 (+40.63%)
Mutual labels:  string
alter-nlu
Natural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (+40.63%)
Mutual labels:  fuzzywuzzy
Stringfication
🔨 Make all objects to String!
Stars: ✭ 33 (+3.13%)
Mutual labels:  string
strutil
Golang metrics for calculating string similarity and other string utility functions
Stars: ✭ 114 (+256.25%)
Mutual labels:  string
stringy
Convert string to camel case, snake case, kebab case / slugify, custom delimiter, pad string, tease string and many other functionalities with help of by Stringy package.
Stars: ✭ 137 (+328.13%)
Mutual labels:  string
AgileStringDecryptor
a dynamic Agile.NET string decryptor that relies on invoke by wwh1004 | Version : 6.X
Stars: ✭ 24 (-25%)
Mutual labels:  string
goin
`in` operator for go
Stars: ✭ 17 (-46.87%)
Mutual labels:  string
string-similarity-js
Lightweight string similarity function for javascript
Stars: ✭ 29 (-9.37%)
Mutual labels:  string
DigitText
The module allows to translate numbers into a text equivalent. This is important in the billing.
Stars: ✭ 22 (-31.25%)
Mutual labels:  string
is-string
Is this value a JS String object or primitive? This module works cross-realm/iframe, and despite ES6 @@toStringTag.
Stars: ✭ 17 (-46.87%)
Mutual labels:  string
jest-serializer-html-string
A better Jest snapshot serializer for plain html strings
Stars: ✭ 17 (-46.87%)
Mutual labels:  string
compact str
A memory efficient string type that can store up to 24* bytes on the stack
Stars: ✭ 322 (+906.25%)
Mutual labels:  string
rMIDAS
R package for missing-data imputation with deep learning
Stars: ✭ 20 (-37.5%)
Mutual labels:  reticulate
str
str is a string module with useful methods that don't exist in Lua's core
Stars: ✭ 30 (-6.25%)
Mutual labels:  string

tic codecov.io CRAN_Status_Badge Downloads Buy Me A Coffee Dependencies

fuzzywuzzyR


The fuzzywuzzyR package is a fuzzy string matching implementation of the fuzzywuzzy python package. It uses the Levenshtein Distance to calculate the differences between sequences. More details on the functionality of fuzzywuzzyR can be found in the blog-post and in the package Vignette.


UPDATE 26-07-2018: A Singularity image file is available in case that someone intends to run fuzzywuzzyR on Ubuntu Linux (locally or in a cloud instance) with all package requirements pre-installed. This allows the user to utilize the fuzzywuzzyR package without having to spend time on the installation process.


System Requirements


  • Python (>= 2.4)

  • difflib

  • fuzzywuzzy ( >=0.15.0 )

  • python-Levenshtein ( >=0.12.0, optional, provides a 4-10x speedup in String Matching, though may result in differing results for certain cases)


Before the installation of any python modules one should check the python-configuration using :


reticulate::py_config()

All modules should be installed in the default python configuration (the configuration that the R-session displays as default), otherwise errors will occur during package installation.


Debian/Ubuntu/Fedora


Python2

sudo apt-get install python-pip
sudo pip install --upgrade pip
pip install fuzzywuzzy
pip install python-Levenshtein

Python 3

sudo apt-get install python3-pip
sudo pip3 install --upgrade pip
pip3 install fuzzywuzzy
pip3 install python-Levenshtein



Macintosh OSX


sudo easy_install pip
sudo pip install fuzzywuzzy
sudo pip install python-Levenshtein

Windows OS


  • Download of get-pip.py
  • Update of the Environment variables ( Control Panel >> System and Security >> System >> Advanced system settings >> Environment variables >> System variables >> Path >> Edit ) by adding ( for instance in case of python 2.7 ) :
C:\Python27;C:\Python27\Scripts
pip install fuzzywuzzy
pip install python-Levenshtein

Installation of the fuzzywuzzyR package


To install the package from CRAN use,

install.packages('fuzzywuzzyR')


and to download the latest version from Github use the install_github function of the devtools package,

devtools::install_github(repo = 'mlampros/fuzzywuzzyR')

Use the following link to report bugs/issues,

https://github.com/mlampros/fuzzywuzzyR/issues


Citation:

If you use the code of this repository in your paper or research please cite both fuzzywuzzyR and the original software https://CRAN.R-project.org/package=fuzzywuzzyR/citation.html:


@Manual{,
  title = {{fuzzywuzzyR}: Fuzzy String Matching in R},
  author = {Lampros Mouselimis},
  year = {2021},
  note = {R package version 1.0.5},
  url = {https://CRAN.R-project.org/package=fuzzywuzzyR},
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].