YomichanJapanese pop-up dictionary extension for Chrome and Firefox.
Stars: ✭ 464 (+271.2%)
ToiroA comparison tool of Japanese tokenizers
Stars: ✭ 95 (-24%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+250.4%)
String CalcPHP calculator library for mathematical terms (expressions) passed as strings
Stars: ✭ 60 (-52%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+246.4%)
Php Parser🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+220%)
JflexThe fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+204%)
JconvPure-JavaScript converter for Japanese character encodings.
Stars: ✭ 91 (-27.2%)
LexmachineLex machinary for go.
Stars: ✭ 335 (+168%)
Vanilla AutokanaA Vanilla-JavaScript library to complete furigana automatically.
Stars: ✭ 48 (-61.6%)
SentencesA multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+134.4%)
GseGo efficient multilingual NLP and text segmentation; support english, chinese, japanese and other. Go 高性能多语言 NLP 和分词
Stars: ✭ 1,695 (+1256%)
YakuhanjpYakumono-Hankaku Only Web Fonts
Stars: ✭ 288 (+130.4%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-63.2%)
pascal-interpreterA simple interpreter for a large subset of Pascal language written for educational purposes
Stars: ✭ 21 (-83.2%)
Owasp MasvsThe Mobile Application Security Verification Standard (MASVS) is a standard for mobile app security.
Stars: ✭ 1,030 (+724%)
TopokanjiTopologically ordered lists of kanji for effective learning
Stars: ✭ 108 (-13.6%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-84.8%)
Adobe Japan1The Adobe-Japan1-7 Character Collection
Stars: ✭ 38 (-69.6%)
ZipanguA library for compatibility about Japan.
Stars: ✭ 27 (-78.4%)
DjurlSimple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-32%)
sembei🍘 単語分割を経由しない単語埋め込み 🍘
Stars: ✭ 14 (-88.8%)
Nlp Js Tools FrenchPOS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-74.4%)
unidic-pyUnidic packaged for installation via pip.
Stars: ✭ 17 (-86.4%)
ChevrotainParser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+1336%)
Omnicat BayesNaive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-76%)
sample-ui-reactMaterial-UI+ React.js + Redux [ Pug / Scss / Babel ]
Stars: ✭ 15 (-88%)
Sentence SplitterText to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-34.4%)
PaddleTokenizer使用 PaddlePaddle 实现基于深度神经网络的中文分词引擎 | A DNN Chinese Tokenizer by Using PaddlePaddle
Stars: ✭ 14 (-88.8%)
LfuzzerFuzzing Parsers with Tokens
Stars: ✭ 28 (-77.6%)
Languagepod101 ScraperPython scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Stars: ✭ 104 (-16.8%)
wana kana rustUtility library for checking and converting between Japanese characters - Hiragana, Katakana - and Romaji
Stars: ✭ 46 (-63.2%)
Momdo.github.ioJapanese translation of the W3C/WHATWG specification(s).
Stars: ✭ 81 (-35.2%)
bredonA modern CSS value compiler in JavaScript
Stars: ✭ 39 (-68.8%)
KWDLCKyoto University Web Document Leads Corpus
Stars: ✭ 64 (-48.8%)
mystem-scalaMorphological analyzer `mystem` (Russian language) wrapper for JVM languages
Stars: ✭ 21 (-83.2%)
Snl CompilerSNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-84.8%)
kanjiHaskell suite for determining what 級 (level) of the 漢字検定 (national Kanji exam) a given Kanji belongs to.
Stars: ✭ 19 (-84.8%)
Risingstars2016A complete overview of the JavaScript landscape in 2016: trends about front-end and node.js frameworks, tooling... Available in English, Japanese and Chinese.
Stars: ✭ 75 (-40%)
Hibi[No Active Development] An Android app for learning Japanese by keeping a journal.
Stars: ✭ 37 (-70.4%)
NatashaSolves basic Russian NLP tasks, API for lower level Natasha projects
Stars: ✭ 788 (+530.4%)
ilmultiTooling to play around with multilingual machine translation for Indian Languages.
Stars: ✭ 19 (-84.8%)
Mustard🌭 Mustard is a Swift library for tokenizing strings when splitting by whitespace doesn't cut it.
Stars: ✭ 689 (+451.2%)
CutletJapanese to romaji converter in Python
Stars: ✭ 124 (-0.8%)
SyntokText tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-1.6%)
TokenizerSource code tokenizer
Stars: ✭ 119 (-4.8%)
Nodejs JaNode.js 日本語ローカリゼーション
Stars: ✭ 98 (-21.6%)
Soynlp한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+390.4%)