JumanppJuman++ (a Morphological Analyzer Toolkit)
Stars: ✭ 254 (+1593.33%)
NagisaA Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (+1633.33%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+3593.33%)
Pytorch Pos TaggingA tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.
Stars: ✭ 96 (+540%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (+906.67%)
MonpaMONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
Stars: ✭ 203 (+1253.33%)
ArticutapiAPI of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的成績。
Stars: ✭ 252 (+1580%)
QutufQutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.
Stars: ✭ 84 (+460%)
VncorenlpA Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+2260%)
RdrpostaggerA fast and accurate POS and morphological tagging toolkit (EACL 2014)
Stars: ✭ 126 (+740%)
Lac百度NLP:分词,词性标注,命名实体识别,词重要性
Stars: ✭ 2,792 (+18513.33%)
CwsSource code for an ACL2016 paper of Chinese word segmentation
Stars: ✭ 81 (+440%)
JptdpNeural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)
Stars: ✭ 146 (+873.33%)
SynThaiThai Word Segmentation and Part-of-Speech Tagging with Deep Learning
Stars: ✭ 41 (+173.33%)
datalinguistStanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+520%)
cn-holidaya lib for chinese holiday
Stars: ✭ 22 (+46.67%)
wasm-cn[翻译中] WebAssembly 中文文档
Stars: ✭ 22 (+46.67%)
seqgan用seqgan训练生成小黄鸡语料
Stars: ✭ 33 (+120%)
customized-symspellJava port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm
Stars: ✭ 51 (+240%)
type-kanaA quiz app to help you learn hiragana and katakana, the Japanese syllabaries
Stars: ✭ 21 (+40%)
DidacticalEnigmaAn integrated translator environment for translating text from Japanese to English
Stars: ✭ 29 (+93.33%)
udarUDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.
Stars: ✭ 15 (+0%)
GameWord记录一下游戏常用单词的中英文对照
Stars: ✭ 157 (+946.67%)
SymSpellCppPyFast SymSpell written in c++ and exposes to python via pybind11
Stars: ✭ 28 (+86.67%)
ATKSpythis repository is a python package that supports SOAP interface to communicate with the Microsoft ATKS
Stars: ✭ 27 (+80%)
friends-langPL for friends in the Japaripark (Logical programming language with Japanese animation-reference joke syntax)
Stars: ✭ 66 (+340%)
WMPoetryThe source codes of Working Memory model for Chinese poetry generation (IJCAI 2018).
Stars: ✭ 49 (+226.67%)
hanzi-toolsConverts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.
Stars: ✭ 69 (+360%)
rippletaggerRippleTagger identifies part-of-speech tags (Nouns, Verbs, and so on...). You give it a sentence, it gives you a list of tags back.
Stars: ✭ 12 (-20%)
fishing-funds基金,大盘,股票,虚拟货币状态栏显示小应用,基于Electron开发,支持MacOS,Windows,Linux客户端,数据源来自天天基金,蚂蚁基金,爱基金,腾讯证券,新浪基金等
Stars: ✭ 424 (+2726.67%)
fontregistererCross-platform auto font registerer for R grapics/Rでグラフ描くためのフォント自動登録パッケージ (クロスプラットフォーム)
Stars: ✭ 14 (-6.67%)
kulaLightweight and highly extensible .NET scripting language.
Stars: ✭ 43 (+186.67%)
ebe-datasetEvidence-based Explanation Dataset (AACL-IJCNLP 2020)
Stars: ✭ 16 (+6.67%)
OpenGNTOpen Greek New Testament Project; NA28 / NA27 Equivalent Text & Resources
Stars: ✭ 55 (+266.67%)
kanji-web-appAngular.js kanji web application
Stars: ✭ 45 (+200%)
Hibi[No Active Development] An Android app for learning Japanese by keeping a journal.
Stars: ✭ 37 (+146.67%)
ChineseFontsConvert asian text to web fonts
Stars: ✭ 14 (-6.67%)
syngA free, open source, cross-platform, Chinese-To-English dictionary for desktops.
Stars: ✭ 108 (+620%)
ODSQAODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
Stars: ✭ 43 (+186.67%)
youtokentome-rubyHigh performance unsupervised text tokenization for Ruby
Stars: ✭ 17 (+13.33%)
VARBook程序员的英语助手,输入中文,智能转换为英文变量
Stars: ✭ 24 (+60%)
LightLM高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
Stars: ✭ 54 (+260%)
ccalendarChinese Calendar in calendar(1) for BSD, Linux & macOS
Stars: ✭ 17 (+13.33%)
CBLUE中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+2426.67%)
madomagiOOP👨💻♐ OOP learning with anime magical girl. (魔法少女で学ぶオブジェクト指向)🧙
Stars: ✭ 17 (+13.33%)
UETsegmenterA toolkit for Vietnamese word segmentation
Stars: ✭ 60 (+300%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (+13.33%)