OsetiDictionary based Sentiment Analysis for Japanese
Stars: ✭ 49 (-68.79%)
The Tab Of WordsA minimal Chrome / Firefox extension to help you learn Japanese words in each new tab.
Stars: ✭ 94 (-40.13%)
Go Search Replace🚀 Search & replace URLs in WordPress SQL files.
Stars: ✭ 57 (-63.69%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+171.34%)
TopokanjiTopologically ordered lists of kanji for effective learning
Stars: ✭ 108 (-31.21%)
PadatiousA neural network intent parser
Stars: ✭ 124 (-21.02%)
Python NameparserA simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+194.27%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+80.89%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (-26.75%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-67.52%)
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-17.2%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-84.08%)
Languagepod101 ScraperPython scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Stars: ✭ 104 (-33.76%)
GohnHatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-89.17%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (-9.55%)
WhatlanggoNatural language detection library for Go
Stars: ✭ 479 (+205.1%)
MtpMulti-lingual Text Processing
Stars: ✭ 87 (-44.59%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+178.98%)
IchiranLinguistic tools for texts in Japanese language
Stars: ✭ 120 (-23.57%)
BsedSimple SQL-like syntax on top of Perl text processing.
Stars: ✭ 414 (+163.69%)
KefirbbA flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (-47.13%)
TerText Expression Runner – Readable and easy to use text expressions
Stars: ✭ 67 (-57.32%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-87.9%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-26.75%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (-17.2%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-63.69%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-28.66%)
PyparsingPython library for creating PEG parsers
Stars: ✭ 1,052 (+570.06%)
BrowsecloudA web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (-8.92%)
Qp Trie RsAn idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (-70.06%)
Command Line Text Processing⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
Stars: ✭ 9,771 (+6123.57%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-85.35%)
LibasciidocA Golang library for processing Asciidoc files.
Stars: ✭ 129 (-17.83%)
Chr🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-88.54%)
BplBinary Processing Language
Stars: ✭ 103 (-34.39%)
JanomeJapanese morphological analysis engine written in pure Python
Stars: ✭ 630 (+301.27%)
NegapojiJapanese negative positive classification.日本語文書のネガポジを判定。
Stars: ✭ 148 (-5.73%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+252.87%)
YomichanJapanese pop-up dictionary extension for Chrome and Firefox.
Stars: ✭ 464 (+195.54%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-21.02%)
Diff Match PatchDiff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+3027.39%)
NostrilNostril: Nonsense String Evaluator
Stars: ✭ 86 (-45.22%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+175.8%)
Kanji KoohiiA web application to help Japanese language learners remember the kanji.
Stars: ✭ 137 (-12.74%)
Aho CorasickA fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+170.06%)
Node RakeA NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.
Stars: ✭ 85 (-45.86%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+121.66%)
Japanese.jsUtil collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (-4.46%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-5.73%)
TmtoolkitText Mining and Topic Modeling Toolkit for Python with parallel processing power
Stars: ✭ 135 (-14.01%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+996.82%)
VirastarCleaning-up Persian Texts!
Stars: ✭ 77 (-50.96%)