KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (-12.06%)
extra-modelCode to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.
Stars: ✭ 43 (-93.17%)
limelightA php Japanese language text analyzer and parser.
Stars: ✭ 76 (-87.94%)
Giveme5w1hExtraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Stars: ✭ 316 (-49.84%)
kanji-web-appAngular.js kanji web application
Stars: ✭ 45 (-92.86%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-92.54%)
wefeWEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
Stars: ✭ 164 (-73.97%)
Hibi[No Active Development] An Android app for learning Japanese by keeping a journal.
Stars: ✭ 37 (-94.13%)
clj-ducklingLanguage, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings. (a duckling clojure fork)
Stars: ✭ 15 (-97.62%)
minieAn open information extraction system that provides compact extractions
Stars: ✭ 83 (-86.83%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (-45.87%)
jrte-corpusJapanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
Stars: ✭ 66 (-89.52%)
YomichanJapanese pop-up dictionary extension for Chrome and Firefox.
Stars: ✭ 464 (-26.35%)
GrammarEngineГрамматический Словарь Русского Языка (+ английский, японский, etc)
Stars: ✭ 68 (-89.21%)
classyclassy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (-90.32%)
jaco-jsJapanese character optimizer for JavaScript
Stars: ✭ 72 (-88.57%)
Chatbot nerchatbot_ner: Named Entity Recognition for chatbots.
Stars: ✭ 273 (-56.67%)
friends-langPL for friends in the Japaripark (Logical programming language with Japanese animation-reference joke syntax)
Stars: ✭ 66 (-89.52%)
bllip-parserBLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
Stars: ✭ 217 (-65.56%)
ppdbInterface for reading the Paraphrase Database (PPDB)
Stars: ✭ 22 (-96.51%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-85.4%)
NLP-toolsUseful python NLP tools (evaluation, GUI interface, tokenization)
Stars: ✭ 39 (-93.81%)
Nuts自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Stars: ✭ 21 (-96.67%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (-32.38%)
simple NERsimple rule based named entity recognition
Stars: ✭ 29 (-95.4%)
unofficial-jisho-apiEncapsulates the official Jisho.org API and also provides kanji, example, and stroke diagram search.
Stars: ✭ 88 (-86.03%)
SudachiA Japanese Tokenizer for Business
Stars: ✭ 496 (-21.27%)
ebe-datasetEvidence-based Explanation Dataset (AACL-IJCNLP 2020)
Stars: ✭ 16 (-97.46%)
OpenPromptAn Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+180.79%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (-49.52%)
TextFeatureSelectionPython library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-93.33%)
madomagiOOP👨💻♐ OOP learning with anime magical girl. (魔法少女で学ぶオブジェクト指向)🧙
Stars: ✭ 17 (-97.3%)
kanji-frequencyKanji usage frequency data collected from various sources
Stars: ✭ 92 (-85.4%)
kotobaA Discord bot for helping with learning Japanese.
Stars: ✭ 118 (-81.27%)
google-news-scraperGoogle News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (-86.03%)
Quick NlpPytorch NLP library based on FastAI
Stars: ✭ 279 (-55.71%)
Nihonoari-AppA little and minimalist Japanese Kana training
Stars: ✭ 66 (-89.52%)
Giveme5WExtraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-97.46%)
empythyAutomated NLP sentiment predictions- batteries included, or use your own data
Stars: ✭ 17 (-97.3%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+3388.57%)
DidacticalEnigmaAn integrated translator environment for translating text from Japanese to English
Stars: ✭ 29 (-95.4%)
rsmorphyMorphological analyzer / inflection engine for Russian and Ukrainian languages rewritten in Rust
Stars: ✭ 27 (-95.71%)
NagisaA Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (-58.73%)
PythainlpThai Natural Language Processing in Python.
Stars: ✭ 582 (-7.62%)
type-kanaA quiz app to help you learn hiragana and katakana, the Japanese syllabaries
Stars: ✭ 21 (-96.67%)
limaThe Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
Stars: ✭ 75 (-88.1%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (-31.27%)
KanaQuizA simple app to quiz the user on identifying Japanese characters.
Stars: ✭ 19 (-96.98%)
fontregistererCross-platform auto font registerer for R grapics/Rでグラフ描くためのフォント自動登録パッケージ (クロスプラットフォーム)
Stars: ✭ 14 (-97.78%)