All Projects → pytorch_Joint-Word-Segmentation-and-POS-Tagging → Similar Projects or Alternatives

81 Open source projects that are alternatives of or similar to pytorch_Joint-Word-Segmentation-and-POS-Tagging

Vncorenlp
A Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+856.76%)
Mutual labels:  word-segmentation, pos-tagging
Jumanpp
Juman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+586.49%)
Mutual labels:  word-segmentation, pos-tagging
SynThai
Thai Word Segmentation and Part-of-Speech Tagging with Deep Learning
Stars: ✭ 41 (+10.81%)
Mutual labels:  word-segmentation, pos-tagging
Nagisa
A Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (+602.7%)
Mutual labels:  word-segmentation, pos-tagging
rakutenma-python
Rakuten MA (Python version)
Stars: ✭ 15 (-59.46%)
Mutual labels:  word-segmentation, pos-tagging
Monpa
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (+448.65%)
Mutual labels:  word-segmentation, pos-tagging
Pytorch-NLU
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (+308.11%)
Mutual labels:  word-segmentation, pos-tagging
Symspellpy
Python port of SymSpell
Stars: ✭ 420 (+1035.14%)
Mutual labels:  word-segmentation
word tokenize
Vietnamese Word Tokenize
Stars: ✭ 45 (+21.62%)
Mutual labels:  word-segmentation
Bert Multitask Learning
BERT for Multitask Learning
Stars: ✭ 380 (+927.03%)
Mutual labels:  word-segmentation
youtokentome-ruby
High performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-54.05%)
Mutual labels:  word-segmentation
Sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 5,540 (+14872.97%)
Mutual labels:  word-segmentation
joineRML
R package for fitting joint models to time-to-event data and multivariate longitudinal data
Stars: ✭ 24 (-35.14%)
Mutual labels:  joint-models
skt
Sanskrit compound segmentation using seq2seq model
Stars: ✭ 21 (-43.24%)
Mutual labels:  word-segmentation
ckipnlp
CKIP CoreNLP Toolkits
Stars: ✭ 92 (+148.65%)
Mutual labels:  word-segmentation
customized-symspell
Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
Stars: ✭ 51 (+37.84%)
Mutual labels:  word-segmentation
hanzi-tools
Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.
Stars: ✭ 69 (+86.49%)
Mutual labels:  word-segmentation
cws-tensorflow
基于Tensorflow的中文分词模型
Stars: ✭ 25 (-32.43%)
Mutual labels:  word-segmentation
Paribhasha
paribhasha.herokuapp.com/
Stars: ✭ 21 (-43.24%)
Mutual labels:  pos-tagging
SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-24.32%)
Mutual labels:  word-segmentation
wink-nlp
Developer friendly Natural Language Processing ✨
Stars: ✭ 312 (+743.24%)
Mutual labels:  pos-tagging
dnn-lstm-word-segment
Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (-35.14%)
Mutual labels:  word-segmentation
Pycantonese
Cantonese Linguistics and NLP in Python
Stars: ✭ 147 (+297.3%)
Mutual labels:  word-segmentation
sylbreak
Syllable segmentation tool for Myanmar language (Burmese) by Ye.
Stars: ✭ 44 (+18.92%)
Mutual labels:  word-segmentation
Malaya
Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/
Stars: ✭ 239 (+545.95%)
Mutual labels:  pos-tagging
sentencepiece
R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
Stars: ✭ 22 (-40.54%)
Mutual labels:  word-segmentation
Kiwi
Kiwi(지능형 한국어 형태소 분석기)
Stars: ✭ 107 (+189.19%)
Mutual labels:  word-segmentation
Sudachipy
Python version of Sudachi, a Japanese tokenizer.
Stars: ✭ 207 (+459.46%)
Mutual labels:  pos-tagging
Pythainlp
Thai Natural Language Processing in Python.
Stars: ✭ 582 (+1472.97%)
Mutual labels:  word-segmentation
syntaxnet
Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing
Stars: ✭ 77 (+108.11%)
Mutual labels:  pos-tagging
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+1070.27%)
Mutual labels:  word-segmentation
gum
Repository for the Georgetown University Multilayer Corpus (GUM)
Stars: ✭ 71 (+91.89%)
Mutual labels:  pos-tagging
cross-lingual-struct-flow
PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656
Stars: ✭ 23 (-37.84%)
Mutual labels:  pos-tagging
Udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+332.43%)
Mutual labels:  pos-tagging
hashformers
Hashformers is a framework for hashtag segmentation with transformers.
Stars: ✭ 18 (-51.35%)
Mutual labels:  word-segmentation
sinling
A collection of NLP tools for Sinhalese (සිංහල).
Stars: ✭ 38 (+2.7%)
Mutual labels:  pos-tagging
Cws
Source code for an ACL2016 paper of Chinese word segmentation
Stars: ✭ 81 (+118.92%)
Mutual labels:  word-segmentation
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (+286.49%)
Mutual labels:  pos-tagging
sequence labeling tf
Sequence Labeling in Tensorflow
Stars: ✭ 18 (-51.35%)
Mutual labels:  pos-tagging
UETsegmenter
A toolkit for Vietnamese word segmentation
Stars: ✭ 60 (+62.16%)
Mutual labels:  word-segmentation
esapp
An unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (-64.86%)
Mutual labels:  word-segmentation
comparable-text-miner
Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning
Stars: ✭ 31 (-16.22%)
Mutual labels:  pos-tagging
Sudachidict
A lexicon for Sudachi
Stars: ✭ 127 (+243.24%)
Mutual labels:  pos-tagging
Lac
百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+7445.95%)
Mutual labels:  word-segmentation
codeprep
A toolkit for pre-processing large source code corpora
Stars: ✭ 39 (+5.41%)
Mutual labels:  word-segmentation
WordSegmentationDP
Word Segmentation with Dynamic Programming
Stars: ✭ 18 (-51.35%)
Mutual labels:  word-segmentation
Pytorch ner bilstm cnn crf
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch
Stars: ✭ 249 (+572.97%)
Mutual labels:  pos-tagging
Symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+5240.54%)
Mutual labels:  word-segmentation
Engtagger
English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger
Stars: ✭ 217 (+486.49%)
Mutual labels:  pos-tagging
FISR
Official repository of FISR (AAAI 2020).
Stars: ✭ 72 (+94.59%)
Mutual labels:  joint-models
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (+359.46%)
Mutual labels:  pos-tagging
Toiro
A comparison tool of Japanese tokenizers
Stars: ✭ 95 (+156.76%)
Mutual labels:  word-segmentation
Jptdp
Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
Stars: ✭ 146 (+294.59%)
Mutual labels:  pos-tagging
TweebankNLP
[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset
Stars: ✭ 84 (+127.03%)
Mutual labels:  pos-tagging
Han Segment
基于隐式马尔可夫模型和正向最大化匹配的中文分词系统
Stars: ✭ 17 (-54.05%)
Mutual labels:  word-segmentation
strollr2d icassp2017
Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017
Stars: ✭ 22 (-40.54%)
Mutual labels:  joint-models
sentencepiece-jni
Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 26 (-29.73%)
Mutual labels:  word-segmentation
nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+86.49%)
Mutual labels:  pos-tagging
spell
Spelling correction and string segmentation written in Go
Stars: ✭ 24 (-35.14%)
Mutual labels:  word-segmentation
Youtokentome
Unsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+1867.57%)
Mutual labels:  word-segmentation
1-60 of 81 similar projects