ropensci / hunspell

Licence: other

High-Performance Stemmer, Tokenizer, and Spell Checker for R

Programming Languages

C++

36643 projects - #6 most used programming language

7636 projects

Projects that are alternatives of or similar to hunspell

spacy hunspell

✏️ Hunspell extension for spaCy 2.0.

Stars: ✭ 94 (-6.93%)

Mutual labels: spell-check, hunspell, spellchecker

hunspell-asm

WebAssembly based Javascript bindings for hunspell spellchecker

Stars: ✭ 60 (-40.59%)

Mutual labels: hunspell, spellchecker

Tokenizers

Fast, Consistent Tokenization of Natural Language Text

Stars: ✭ 161 (+59.41%)

Mutual labels: tokenizer, r-package

spellchecker-wasm

SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell

Stars: ✭ 46 (-54.46%)

Mutual labels: spell-check, spellchecker

Awesome Persian Nlp Ir

Curated List of Persian Natural Language Processing and Information Retrieval Tools and Resources

Stars: ✭ 460 (+355.45%)

Mutual labels: spell-check, stemmer

Spelling

Tools for Spell Checking in R

Stars: ✭ 82 (-18.81%)

Mutual labels: spell-check, r-package

Nlp Js Tools French

POS Tagger, lemmatizer and stemmer for french language in javascript

Stars: ✭ 32 (-68.32%)

Mutual labels: tokenizer, stemmer

Udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

Stars: ✭ 160 (+58.42%)

Mutual labels: tokenizer, r-package

WordSegmentationDP

Word Segmentation with Dynamic Programming

Stars: ✭ 18 (-82.18%)

Mutual labels: spell-check, spellchecker

ctrdata

Aggregate and analyse information on clinical trials from public registers

Stars: ✭ 26 (-74.26%)

Mutual labels: r-package

gd-tokenizer

A small godot project with a tokenizer written in GDScript.

Stars: ✭ 34 (-66.34%)

Mutual labels: tokenizer

covidestim

Bayesian nowcasting with adjustment for delayed and incomplete reporting to estimate COVID-19 infections in the United States

Stars: ✭ 20 (-80.2%)

Mutual labels: r-package

musemeta

R Client for scraping museum metadata

Stars: ✭ 24 (-76.24%)

Mutual labels: r-package

medrxivr

Access and search medRxiv and bioRxiv preprint data

Stars: ✭ 34 (-66.34%)

Mutual labels: r-package

rcites

📦 R package to access the CITES Speciesplus database

Stars: ✭ 12 (-88.12%)

Mutual labels: r-package

lara-hungarian-nlp

NLP class for rapid ChatBot development in Hungarian language

Stars: ✭ 27 (-73.27%)

Mutual labels: stemmer

traits

R package for accessing species trait data from multiple databases

Stars: ✭ 38 (-62.38%)

Mutual labels: r-package

ggimg

ggimg: Graphics Layers for Plotting Image Data with ggplot2

Stars: ✭ 51 (-49.5%)

Mutual labels: r-package

gdoc

⛔ ARCHIVED ⛔ An R Markdown Template for Google Docs

Stars: ✭ 30 (-70.3%)

Mutual labels: r-package

SwiLex

A universal lexer library in Swift.

Stars: ✭ 29 (-71.29%)

Mutual labels: tokenizer

View All Similar Projects ➔

rOpenSci: The hunspell package

High-Performance Stemmer, Tokenizer, and Spell Checker for R

Low level spell checker and morphological analyzer based on the famous hunspell library https://hunspell.github.io. The package can analyze or check individual words as well as tokenize text, latex, html or xml documents. For a more user-friendly interface use the 'spelling' package which builds on this package with utilities to automate checking of files, documentation and vignettes in all common formats.

Installation

This package includes a bundled version of libhunspell and no longer depends on external system libraries:

install.packages("hunspell")

Documentation

About the R package:

Blog post: Hunspell: Spell Checker and Text Parser for R
Blog post: Stemming and Spell Checking in R

Hello World

# Check individual words
words <- c("beer", "wiskey", "wine")
correct <- hunspell_check(words)
print(correct)

# Find suggestions for incorrect words
hunspell_suggest(words[!correct])

# Extract incorrect from a piece of text
bad <- hunspell("spell checkers are not neccessairy for langauge ninja's")
print(bad[[1]])
hunspell_suggest(bad[[1]])

# Stemming
words <- c("love", "loving", "lovingly", "loved", "lover", "lovely", "love")
hunspell_stem(words)
hunspell_analyze(words)

The spelling package uses this package to spell R package documentation:

# Spell check a package
library(spelling)
spell_check_package("~/mypackage")

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

ropensci / hunspell

Programming Languages

Labels

Projects that are alternatives of or similar to hunspell

rOpenSci: The hunspell package

Installation

Documentation

Hello World