All Projects → Nagisa → Similar Projects or Alternatives

312 Open source projects that are alternatives of or similar to Nagisa

A comparison tool of Japanese tokenizers

Stars: ✭ 95 (-63.46%)

Mutual labels: japanese, nlp-library, word-segmentation

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…

Stars: ✭ 151 (-41.92%)

Mutual labels: word-segmentation, pos-tagging, sequence-labeling

Kagome

Self-contained Japanese Morphological Analyzer written in pure Go

Stars: ✭ 554 (+113.08%)

Mutual labels: japanese, nlp-library, pos-tagging

Jumanpp

Juman++ (a Morphological Analyzer Toolkit)

Stars: ✭ 254 (-2.31%)

Mutual labels: japanese, pos-tagging, word-segmentation

Multi Task Nlp

multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.

Stars: ✭ 221 (-15%)

Mutual labels: nlp-library, sequence-labeling

Vncorenlp

A Vietnamese natural language processing toolkit (NAACL 2018)

Stars: ✭ 354 (+36.15%)

Mutual labels: pos-tagging, word-segmentation

Kuromoji

Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search

Stars: ✭ 745 (+186.54%)

Mutual labels: japanese, nlp-library

rakutenma-python

Rakuten MA (Python version)

Stars: ✭ 15 (-94.23%)

Mutual labels: word-segmentation, pos-tagging

Pythainlp

Thai Natural Language Processing in Python.

Stars: ✭ 582 (+123.85%)

Mutual labels: nlp-library, word-segmentation

A Pytorch Tutorial To Sequence Labeling

Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling

Stars: ✭ 257 (-1.15%)

Mutual labels: sequence-labeling, pos-tagging

Nuts

自然语言处理常见任务（主要包括文本分类，序列标注，自动问答等）解决方案试验田

Stars: ✭ 21 (-91.92%)

Mutual labels: nlp-library, sequence-labeling

Sudachipy

Python version of Sudachi, a Japanese tokenizer.

Stars: ✭ 207 (-20.38%)

Mutual labels: nlp-library, pos-tagging

Pytorch ner bilstm cnn crf

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch

Stars: ✭ 249 (-4.23%)

Mutual labels: sequence-labeling, pos-tagging

Monpa

MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型

Stars: ✭ 203 (-21.92%)

Mutual labels: pos-tagging, word-segmentation

Ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

Stars: ✭ 433 (+66.54%)

Mutual labels: nlp-library, word-segmentation

fairseq-tagging

a Fairseq fork for sequence tagging/labeling tasks

Stars: ✭ 26 (-90%)

Mutual labels: pos-tagging, sequence-labeling

SynThai

Thai Word Segmentation and Part-of-Speech Tagging with Deep Learning

Stars: ✭ 41 (-84.23%)

Mutual labels: word-segmentation, pos-tagging

Sudachi

A Japanese Tokenizer for Business

Stars: ✭ 496 (+90.77%)

Mutual labels: nlp-library, pos-tagging

sequence labeling tf

Sequence Labeling in Tensorflow

Stars: ✭ 18 (-93.08%)

Mutual labels: pos-tagging, sequence-labeling

pytorch Joint-Word-Segmentation-and-POS-Tagging

Paper: A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging

Stars: ✭ 37 (-85.77%)

Mutual labels: word-segmentation, pos-tagging

SymSpellCppPy

Fast SymSpell written in c++ and exposes to python via pybind11

Stars: ✭ 28 (-89.23%)

Mutual labels: word-segmentation

sembei

🍘 単語分割を経由しない単語埋め込み 🍘

Stars: ✭ 14 (-94.62%)

Mutual labels: japanese

Japanese-Words

整理日语N2单词（新标准日本语初级和中级）

Stars: ✭ 41 (-84.23%)

Mutual labels: japanese

gazou

Japanese OCR for Linux & Windows

Stars: ✭ 32 (-87.69%)

Mutual labels: japanese

CrowdLayer

A neural network layer that enables training of deep neural networks directly from crowdsourced labels (e.g. from Amazon Mechanical Turk) or, more generally, labels from multiple annotators with different biases and levels of expertise.

Stars: ✭ 45 (-82.69%)

Mutual labels: sequence-labeling

kanji-web-app

Angular.js kanji web application

Stars: ✭ 45 (-82.69%)

Mutual labels: japanese

analyze-desumasu-dearu

文の敬体(ですます調)、常体(である調)を解析するJavaScriptライブラリ

Stars: ✭ 15 (-94.23%)

Mutual labels: japanese

Giveme5W

Extraction of the five journalistic W-questions (5W) from news articles

Stars: ✭ 16 (-93.85%)

Mutual labels: nlp-library

unofficial-jisho-api

Encapsulates the official Jisho.org API and also provides kanji, example, and stroke diagram search.

Stars: ✭ 88 (-66.15%)

Mutual labels: japanese

customized-symspell

Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm

Stars: ✭ 51 (-80.38%)

Mutual labels: word-segmentation

scoop-for-jp

Scoop bucket for ALL Japanese users.

Stars: ✭ 17 (-93.46%)

Mutual labels: japanese

wana kana rust

Utility library for checking and converting between Japanese characters - Hiragana, Katakana - and Romaji

Stars: ✭ 46 (-82.31%)

Mutual labels: japanese

Articutapi

API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到 SIGHAN 2005 F1-measure 94% 以上，Recall 96% 以上的成績。

Stars: ✭ 252 (-3.08%)

Mutual labels: pos-tagging

unsupervised-pos-tagging

教師なし品詞タグ推定

Stars: ✭ 16 (-93.85%)

Mutual labels: pos-tagging

visual syntactic embedding video captioning

Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*

Stars: ✭ 23 (-91.15%)

Mutual labels: pos-tagging

unidic-py

Unidic packaged for installation via pip.

Stars: ✭ 17 (-93.46%)

Mutual labels: japanese

knp

A Japanese Parser

Stars: ✭ 16 (-93.85%)

Mutual labels: japanese

KWDLC

Kyoto University Web Document Leads Corpus

Stars: ✭ 64 (-75.38%)

Mutual labels: japanese

japanese-pitch-accent-resources

Trying to consolidate japanese phonetic, and in particular pitch accent resources into one list

Stars: ✭ 64 (-75.38%)

Mutual labels: japanese

jp-ocr-prunned-cnn

Attempting feature map prunning on a CNN trained for Japanese OCR

Stars: ✭ 15 (-94.23%)

Mutual labels: japanese

textlint-ja

textlintの日本語コミュニティ/ルールのアイデア

Stars: ✭ 41 (-84.23%)

Mutual labels: japanese

ATKSpy

this repository is a python package that supports SOAP interface to communicate with the Microsoft ATKS

Stars: ✭ 27 (-89.62%)

Mutual labels: pos-tagging

hanzi-tools

Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.

Stars: ✭ 69 (-73.46%)

Mutual labels: word-segmentation

kanji

Haskell suite for determining what 級 (level) of the 漢字検定 (national Kanji exam) a given Kanji belongs to.

Stars: ✭ 19 (-92.69%)

Mutual labels: japanese

TALPCo

TUFS Asian Language Parallel Corpus

Stars: ✭ 32 (-87.69%)

Mutual labels: japanese

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Stars: ✭ 164 (-36.92%)

Mutual labels: sequence-labeling

KanjiRecognitionDictionary

Perfect for those who forgets kanji pronunciation

Stars: ✭ 14 (-94.62%)

Mutual labels: japanese

AlpacaTag

AlpacaTag: An Active Learning-based Crowd Annotation Framework for Sequence Tagging (ACL 2019 Demo)

Stars: ✭ 126 (-51.54%)

Mutual labels: sequence-labeling

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks