spellrSpell check your source code
Stars: ✭ 31 (+29.17%)
Spell-ITFlutter Game for improving your english vocabulary skills
Stars: ✭ 16 (-33.33%)
GreynirCorrectSpelling and grammar correction for Icelandic
Stars: ✭ 12 (-50%)
Api GeneratorPHP-code generator for Laravel framework, with complete support of JSON-API data format
Stars: ✭ 244 (+916.67%)
HallelujahimhallelujahIM(哈利路亚 英文输入法) is an intelligent English input method with auto-suggestions and spell check features, Mac only.
Stars: ✭ 1,334 (+5458.33%)
TextidoteSpelling, grammar and style checking on LaTeX documents
Stars: ✭ 483 (+1912.5%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+1816.67%)
FuzzySpell checking and fuzzy search suggestion written in Go
Stars: ✭ 290 (+1108.33%)
vim-hugo-helperA small Vim plugin with a set of helpers for Hugo https://gohugo.io
Stars: ✭ 82 (+241.67%)
hunspellHigh-Performance Stemmer, Tokenizer, and Spell Checker for R
Stars: ✭ 101 (+320.83%)
MonpaMONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (+745.83%)
Lac百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+11533.33%)
PycantoneseCantonese Linguistics and NLP in Python
Stars: ✭ 147 (+512.5%)
KiwiKiwi(지능형 한국어 형태소 분석기)
Stars: ✭ 107 (+345.83%)
ToiroA comparison tool of Japanese tokenizers
Stars: ✭ 95 (+295.83%)
CwsSource code for an ACL2016 paper of Chinese word segmentation
Stars: ✭ 81 (+237.5%)
Han Segment基于隐式马尔可夫模型和正向最大化匹配的中文分词系统
Stars: ✭ 17 (-29.17%)
YoutokentomeUnsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+2933.33%)
PythainlpThai Natural Language Processing in Python.
Stars: ✭ 582 (+2325%)
SentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 5,540 (+22983.33%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+1704.17%)
VncorenlpA Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+1375%)
NagisaA Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (+983.33%)
JumanppJuman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+958.33%)
hashformersHashformers is a framework for hashtag segmentation with transformers.
Stars: ✭ 18 (-25%)
youtokentome-rubyHigh performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-29.17%)
UETsegmenterA toolkit for Vietnamese word segmentation
Stars: ✭ 60 (+150%)
hanzi-toolsConverts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.
Stars: ✭ 69 (+187.5%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (+529.17%)
dnn-lstm-word-segmentChinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (+0%)
codeprepA toolkit for pre-processing large source code corpora
Stars: ✭ 39 (+62.5%)
sylbreakSyllable segmentation tool for Myanmar language (Burmese) by Ye.
Stars: ✭ 44 (+83.33%)
sentencepiece-jniJava JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 26 (+8.33%)
sktSanskrit compound segmentation using seq2seq model
Stars: ✭ 21 (-12.5%)
ckipnlpCKIP CoreNLP Toolkits
Stars: ✭ 92 (+283.33%)
sentencepieceR package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
Stars: ✭ 22 (-8.33%)
CorrectLyCorrectLy - Open Source Spelling & Grammar correction
Stars: ✭ 23 (-4.17%)
sqlite-spellfixLoadable spellfix1 extension for sqlite as python package
Stars: ✭ 13 (-45.83%)
grammarifyGrammarify is a npm package that safely cleans up text that has mispellings, improper capitalization, lexical illusions, among other things.
Stars: ✭ 43 (+79.17%)
deep-spell-checkrKeras implementation of character-level sequence-to-sequence learning for spelling correction
Stars: ✭ 65 (+170.83%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+11583.33%)