All Projects → pytorch_Joint-Word-Segmentation-and-POS-Tagging → Similar Projects or Alternatives

81 Open source projects that are alternatives of or similar to pytorch_Joint-Word-Segmentation-and-POS-Tagging

A Vietnamese natural language processing toolkit (NAACL 2018)

Stars: ✭ 354 (+856.76%)

Mutual labels: word-segmentation, pos-tagging

Juman++ (a Morphological Analyzer Toolkit)

Stars: ✭ 254 (+586.49%)

Mutual labels: word-segmentation, pos-tagging

Thai Word Segmentation and Part-of-Speech Tagging with Deep Learning

Stars: ✭ 41 (+10.81%)

Mutual labels: word-segmentation, pos-tagging

A Japanese tokenizer based on recurrent neural networks

Stars: ✭ 260 (+602.7%)

Mutual labels: word-segmentation, pos-tagging

rakutenma-python

Rakuten MA (Python version)

Stars: ✭ 15 (-59.46%)

Mutual labels: word-segmentation, pos-tagging

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

Stars: ✭ 203 (+448.65%)

Mutual labels: word-segmentation, pos-tagging

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (+308.11%)

Mutual labels: word-segmentation, pos-tagging

Python port of SymSpell

Stars: ✭ 420 (+1035.14%)

Mutual labels: word-segmentation

Vietnamese Word Tokenize

Stars: ✭ 45 (+21.62%)

Mutual labels: word-segmentation

Bert Multitask Learning

BERT for Multitask Learning

Stars: ✭ 380 (+927.03%)

Mutual labels: word-segmentation

youtokentome-ruby

High performance unsupervised text tokenization for Ruby

Stars: ✭ 17 (-54.05%)

Mutual labels: word-segmentation

Unsupervised text tokenizer for Neural Network-based text generation.

Stars: ✭ 5,540 (+14872.97%)

Mutual labels: word-segmentation

R package for fitting joint models to time-to-event data and multivariate longitudinal data

Stars: ✭ 24 (-35.14%)

Mutual labels: joint-models

Sanskrit compound segmentation using seq2seq model

Stars: ✭ 21 (-43.24%)

Mutual labels: word-segmentation

CKIP CoreNLP Toolkits

Stars: ✭ 92 (+148.65%)

Mutual labels: word-segmentation

customized-symspell

Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm

Stars: ✭ 51 (+37.84%)

Mutual labels: word-segmentation

Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.

Stars: ✭ 69 (+86.49%)

Mutual labels: word-segmentation

基于Tensorflow的中文分词模型

Stars: ✭ 25 (-32.43%)

Mutual labels: word-segmentation

paribhasha.herokuapp.com/

Stars: ✭ 21 (-43.24%)

Mutual labels: pos-tagging

Fast SymSpell written in c++ and exposes to python via pybind11

Stars: ✭ 28 (-24.32%)

Mutual labels: word-segmentation

Developer friendly Natural Language Processing ✨

Stars: ✭ 312 (+743.24%)

Mutual labels: pos-tagging

dnn-lstm-word-segment

Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network

Stars: ✭ 24 (-35.14%)

Mutual labels: word-segmentation

Cantonese Linguistics and NLP in Python

Stars: ✭ 147 (+297.3%)

Mutual labels: word-segmentation

Syllable segmentation tool for Myanmar language (Burmese) by Ye.

Stars: ✭ 44 (+18.92%)

Mutual labels: word-segmentation

Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/

Stars: ✭ 239 (+545.95%)

Mutual labels: pos-tagging

R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece

Stars: ✭ 22 (-40.54%)

Mutual labels: word-segmentation

Kiwi(지능형 한국어 형태소 분석기)

Stars: ✭ 107 (+189.19%)

Mutual labels: word-segmentation

Python version of Sudachi, a Japanese tokenizer.

Stars: ✭ 207 (+459.46%)

Mutual labels: pos-tagging

Thai Natural Language Processing in Python.

Stars: ✭ 582 (+1472.97%)

Mutual labels: word-segmentation

Syntaxnet Parsey McParseface wrapper for POS tagging and dependency parsing

Stars: ✭ 77 (+108.11%)

Mutual labels: pos-tagging

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

Stars: ✭ 433 (+1070.27%)

Mutual labels: word-segmentation

Repository for the Georgetown University Multilayer Corpus (GUM)

Stars: ✭ 71 (+91.89%)

Mutual labels: pos-tagging

cross-lingual-struct-flow

PyTorch implementation of ACL paper https://arxiv.org/abs/1906.02656

Stars: ✭ 23 (-37.84%)

Mutual labels: pos-tagging

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

Stars: ✭ 160 (+332.43%)

Mutual labels: pos-tagging

Hashformers is a framework for hashtag segmentation with transformers.

Stars: ✭ 18 (-51.35%)

Mutual labels: word-segmentation

A collection of NLP tools for Sinhalese (සිංහල).

Stars: ✭ 38 (+2.7%)

Mutual labels: pos-tagging

Source code for an ACL2016 paper of Chinese word segmentation

Stars: ✭ 81 (+118.92%)

Mutual labels: word-segmentation

Indonesian Nlp Resources

data resource untuk NLP bahasa indonesia

Stars: ✭ 143 (+286.49%)

Mutual labels: pos-tagging

sequence labeling tf

Sequence Labeling in Tensorflow

Stars: ✭ 18 (-51.35%)

Mutual labels: pos-tagging

A toolkit for Vietnamese word segmentation

Stars: ✭ 60 (+62.16%)

Mutual labels: word-segmentation

An unsupervised Chinese word segmentation tool.

Stars: ✭ 13 (-64.86%)

Mutual labels: word-segmentation

comparable-text-miner

Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary translation, documents alignment, corpus information, text classification, tf-idf computation, text similarity computation, html documents cleaning

Stars: ✭ 31 (-16.22%)

Mutual labels: pos-tagging

A lexicon for Sudachi

Stars: ✭ 127 (+243.24%)

Mutual labels: pos-tagging

百度NLP：分词，词性标注，命名实体识别，词重要性

Stars: ✭ 2,792 (+7445.95%)

Mutual labels: word-segmentation

A toolkit for pre-processing large source code corpora

Stars: ✭ 39 (+5.41%)

Mutual labels: word-segmentation

WordSegmentationDP

Word Segmentation with Dynamic Programming

Stars: ✭ 18 (-51.35%)

Mutual labels: word-segmentation

Pytorch ner bilstm cnn crf

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch

Stars: ✭ 249 (+572.97%)

Mutual labels: pos-tagging

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Stars: ✭ 1,976 (+5240.54%)

Mutual labels: word-segmentation

English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger

Stars: ✭ 217 (+486.49%)

Mutual labels: pos-tagging

Official repository of FISR (AAAI 2020).

Stars: ✭ 72 (+94.59%)

Mutual labels: joint-models

Vietnamese NLP Toolkit for Node

Stars: ✭ 170 (+359.46%)

Mutual labels: pos-tagging

A comparison tool of Japanese tokenizers

Stars: ✭ 95 (+156.76%)

Mutual labels: word-segmentation

Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)

Stars: ✭ 146 (+294.59%)

Mutual labels: pos-tagging

[LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebank-NER dataset

Stars: ✭ 84 (+127.03%)

Mutual labels: pos-tagging

基于隐式马尔可夫模型和正向最大化匹配的中文分词系统

Stars: ✭ 17 (-54.05%)

Mutual labels: word-segmentation

strollr2d icassp2017

Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017

Stars: ✭ 22 (-40.54%)

Mutual labels: joint-models

sentencepiece-jni

Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.

Stars: ✭ 26 (-29.73%)

Mutual labels: word-segmentation

nlp-cheat-sheet-python

NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition

Stars: ✭ 69 (+86.49%)

Mutual labels: pos-tagging

Spelling correction and string segmentation written in Go

Stars: ✭ 24 (-35.14%)

Mutual labels: word-segmentation

Unsupervised text tokenizer focused on computational efficiency

Stars: ✭ 728 (+1867.57%)

Mutual labels: word-segmentation

1-60 of 81 similar projects