All Projects → UETsegmenter → Similar Projects or Alternatives

54 Open source projects that are alternatives of or similar to UETsegmenter

word tokenize
Vietnamese Word Tokenize
Stars: ✭ 45 (-25%)
Mutual labels:  vietnamese, word-segmentation
Vietnamese-Accent-Prediction
A simple/fast/accurate accent prediction for non-accented Vietnamese text
Stars: ✭ 31 (-48.33%)
Mutual labels:  vietnamese
SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (-53.33%)
Mutual labels:  word-segmentation
customized-symspell
Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
Stars: ✭ 51 (-15%)
Mutual labels:  word-segmentation
tudien
Từ điển tiếng Việt dành cho Kindle
Stars: ✭ 38 (-36.67%)
Mutual labels:  vietnamese
number-to-words
⚡ Thư viện hổ trợ chuyển đổi số sang chữ số Tiếng Việt.
Stars: ✭ 19 (-68.33%)
Mutual labels:  vietnamese
classification
Vietnamese Text Classification
Stars: ✭ 39 (-35%)
Mutual labels:  vietnamese
hanzi-tools
Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.
Stars: ✭ 69 (+15%)
Mutual labels:  word-segmentation
Pytorch-NLU
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (+151.67%)
Mutual labels:  word-segmentation
community
Ông Dev Community
Stars: ✭ 64 (+6.67%)
Mutual labels:  vietnamese
google assistant vietnamese speaking
Đây là dự án độ lại loa thông minh chạy Google Assistant hỗ trợ đa ngôn ngữ trong đó có tiếng Việt, phần source code do Nguyễn Duy code lại từ Source Gốc của Google
Stars: ✭ 19 (-68.33%)
Mutual labels:  vietnamese
dnn-lstm-word-segment
Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network
Stars: ✭ 24 (-60%)
Mutual labels:  word-segmentation
codeprep
A toolkit for pre-processing large source code corpora
Stars: ✭ 39 (-35%)
Mutual labels:  word-segmentation
SpeakIt Vietnamese TTS
Vietnamese Text-to-Speech on Windows Project (zalo-speech)
Stars: ✭ 81 (+35%)
Mutual labels:  vietnamese
sylbreak
Syllable segmentation tool for Myanmar language (Burmese) by Ye.
Stars: ✭ 44 (-26.67%)
Mutual labels:  word-segmentation
pytorch Joint-Word-Segmentation-and-POS-Tagging
Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging
Stars: ✭ 37 (-38.33%)
Mutual labels:  word-segmentation
sentencepiece-jni
Java JNI wrapper for SentencePiece: unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 26 (-56.67%)
Mutual labels:  word-segmentation
JointIDSF
BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)
Stars: ✭ 55 (-8.33%)
Mutual labels:  vietnamese
vietTTS
Vietnamese Text to Speech library
Stars: ✭ 78 (+30%)
Mutual labels:  vietnamese
skt
Sanskrit compound segmentation using seq2seq model
Stars: ✭ 21 (-65%)
Mutual labels:  word-segmentation
ckipnlp
CKIP CoreNLP Toolkits
Stars: ✭ 92 (+53.33%)
Mutual labels:  word-segmentation
WordSegmentationDP
Word Segmentation with Dynamic Programming
Stars: ✭ 18 (-70%)
Mutual labels:  word-segmentation
sentencepiece
R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
Stars: ✭ 22 (-63.33%)
Mutual labels:  word-segmentation
spell
Spelling correction and string segmentation written in Go
Stars: ✭ 24 (-60%)
Mutual labels:  word-segmentation
vietnamese-roberta
A Robustly Optimized BERT Pretraining Approach for Vietnamese
Stars: ✭ 22 (-63.33%)
Mutual labels:  vietnamese
SynThai
Thai Word Segmentation and Part-of-Speech Tagging with Deep Learning
Stars: ✭ 41 (-31.67%)
Mutual labels:  word-segmentation
vietnamese-password-dicts
Tổng hợp danh sách mật khẩu wifi tiếng Việt sử dụng cho aircrack-ng
Stars: ✭ 40 (-33.33%)
Mutual labels:  vietnamese
esapp
An unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (-78.33%)
Mutual labels:  word-segmentation
automatic speech recognition
Vietnamese Automatic Speech Recognition
Stars: ✭ 58 (-3.33%)
Mutual labels:  vietnamese
TALPCo
TUFS Asian Language Parallel Corpus
Stars: ✭ 32 (-46.67%)
Mutual labels:  vietnamese
vietnamese word seperate
Seperate vietnamese using lstm
Stars: ✭ 13 (-78.33%)
Mutual labels:  vietnamese
Userscript
Userscripts collection written by me
Stars: ✭ 92 (+53.33%)
Mutual labels:  vietnamese
lstm-crf-tagging
No description or website provided.
Stars: ✭ 13 (-78.33%)
Mutual labels:  vietnamese
Monpa
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (+238.33%)
Mutual labels:  word-segmentation
Lac
百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+4553.33%)
Mutual labels:  word-segmentation
Pycantonese
Cantonese Linguistics and NLP in Python
Stars: ✭ 147 (+145%)
Mutual labels:  word-segmentation
Symspell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Stars: ✭ 1,976 (+3193.33%)
Mutual labels:  word-segmentation
Kiwi
Kiwi(지능형 한국어 형태소 분석기)
Stars: ✭ 107 (+78.33%)
Mutual labels:  word-segmentation
Toiro
A comparison tool of Japanese tokenizers
Stars: ✭ 95 (+58.33%)
Mutual labels:  word-segmentation
Cws
Source code for an ACL2016 paper of Chinese word segmentation
Stars: ✭ 81 (+35%)
Mutual labels:  word-segmentation
Han Segment
基于隐式马尔可夫模型和正向最大化匹配的中文分词系统
Stars: ✭ 17 (-71.67%)
Mutual labels:  word-segmentation
Youtokentome
Unsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+1113.33%)
Mutual labels:  word-segmentation
Pythainlp
Thai Natural Language Processing in Python.
Stars: ✭ 582 (+870%)
Mutual labels:  word-segmentation
Sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 5,540 (+9133.33%)
Mutual labels:  word-segmentation
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+621.67%)
Mutual labels:  word-segmentation
Symspellpy
Python port of SymSpell
Stars: ✭ 420 (+600%)
Mutual labels:  word-segmentation
Bert Multitask Learning
BERT for Multitask Learning
Stars: ✭ 380 (+533.33%)
Mutual labels:  word-segmentation
Vncorenlp
A Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+490%)
Mutual labels:  word-segmentation
Nagisa
A Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (+333.33%)
Mutual labels:  word-segmentation
Jumanpp
Juman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+323.33%)
Mutual labels:  word-segmentation
hashformers
Hashformers is a framework for hashtag segmentation with transformers.
Stars: ✭ 18 (-70%)
Mutual labels:  word-segmentation
cws-tensorflow
基于Tensorflow的中文分词模型
Stars: ✭ 25 (-58.33%)
Mutual labels:  word-segmentation
rakutenma-python
Rakuten MA (Python version)
Stars: ✭ 15 (-75%)
Mutual labels:  word-segmentation
youtokentome-ruby
High performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (-71.67%)
Mutual labels:  word-segmentation
1-54 of 54 similar projects