All Projects → ropensci → hunspell

ropensci / hunspell

Licence: other
High-Performance Stemmer, Tokenizer, and Spell Checker for R

Programming Languages

C++
36643 projects - #6 most used programming language
r
7636 projects

Projects that are alternatives of or similar to hunspell

spacy hunspell
✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (-6.93%)
Mutual labels:  spell-check, hunspell, spellchecker
hunspell-asm
WebAssembly based Javascript bindings for hunspell spellchecker
Stars: ✭ 60 (-40.59%)
Mutual labels:  hunspell, spellchecker
Tokenizers
Fast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (+59.41%)
Mutual labels:  tokenizer, r-package
spellchecker-wasm
SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell
Stars: ✭ 46 (-54.46%)
Mutual labels:  spell-check, spellchecker
Awesome Persian Nlp Ir
Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+355.45%)
Mutual labels:  spell-check, stemmer
Spelling
Tools for Spell Checking in R
Stars: ✭ 82 (-18.81%)
Mutual labels:  spell-check, r-package
Nlp Js Tools French
POS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-68.32%)
Mutual labels:  tokenizer, stemmer
Udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+58.42%)
Mutual labels:  tokenizer, r-package
WordSegmentationDP
Word Segmentation with Dynamic Programming
Stars: ✭ 18 (-82.18%)
Mutual labels:  spell-check, spellchecker
ctrdata
Aggregate and analyse information on clinical trials from public registers
Stars: ✭ 26 (-74.26%)
Mutual labels:  r-package
gd-tokenizer
A small godot project with a tokenizer written in GDScript.
Stars: ✭ 34 (-66.34%)
Mutual labels:  tokenizer
covidestim
Bayesian nowcasting with adjustment for delayed and incomplete reporting to estimate COVID-19 infections in the United States
Stars: ✭ 20 (-80.2%)
Mutual labels:  r-package
musemeta
R Client for scraping museum metadata
Stars: ✭ 24 (-76.24%)
Mutual labels:  r-package
medrxivr
Access and search medRxiv and bioRxiv preprint data
Stars: ✭ 34 (-66.34%)
Mutual labels:  r-package
rcites
📦 R package to access the CITES Speciesplus database
Stars: ✭ 12 (-88.12%)
Mutual labels:  r-package
lara-hungarian-nlp
NLP class for rapid ChatBot development in Hungarian language
Stars: ✭ 27 (-73.27%)
Mutual labels:  stemmer
traits
R package for accessing species trait data from multiple databases
Stars: ✭ 38 (-62.38%)
Mutual labels:  r-package
ggimg
ggimg: Graphics Layers for Plotting Image Data with ggplot2
Stars: ✭ 51 (-49.5%)
Mutual labels:  r-package
gdoc
⛔ ARCHIVED ⛔ An R Markdown Template for Google Docs
Stars: ✭ 30 (-70.3%)
Mutual labels:  r-package
SwiLex
A universal lexer library in Swift.
Stars: ✭ 29 (-71.29%)
Mutual labels:  tokenizer

rOpenSci: The hunspell package

High-Performance Stemmer, Tokenizer, and Spell Checker for R

Project Status: Active – The project has reached a stable, usable state and is being actively developed. CRAN_Status_Badge CRAN RStudio mirror downloads

Low level spell checker and morphological analyzer based on the famous hunspell library https://hunspell.github.io. The package can analyze or check individual words as well as tokenize text, latex, html or xml documents. For a more user-friendly interface use the 'spelling' package which builds on this package with utilities to automate checking of files, documentation and vignettes in all common formats.

Installation

This package includes a bundled version of libhunspell and no longer depends on external system libraries:

install.packages("hunspell")

Documentation

About the R package:

Hello World

# Check individual words
words <- c("beer", "wiskey", "wine")
correct <- hunspell_check(words)
print(correct)

# Find suggestions for incorrect words
hunspell_suggest(words[!correct])

# Extract incorrect from a piece of text
bad <- hunspell("spell checkers are not neccessairy for langauge ninja's")
print(bad[[1]])
hunspell_suggest(bad[[1]])

# Stemming
words <- c("love", "loving", "lovingly", "loved", "lover", "lovely", "love")
hunspell_stem(words)
hunspell_analyze(words)

The spelling package uses this package to spell R package documentation:

# Spell check a package
library(spelling)
spell_check_package("~/mypackage")

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].