learn perl onelinersExample based guide for text processing with perl from the command line
Stars: ✭ 63 (-38.83%)
s3-utilsUtilities and tools based around Amazon S3 to provide convenience APIs in a CLI
Stars: ✭ 45 (-56.31%)
vi-rsVietnamese Input Method library
Stars: ✭ 69 (-33.01%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-73.79%)
fuzzychineseA small package to fuzzy match chinese words
Stars: ✭ 50 (-51.46%)
Emotion-recognition-from-tweetsA comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Stars: ✭ 17 (-83.5%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (-11.65%)
text2videoText to Video Generation Problem
Stars: ✭ 28 (-72.82%)
s3-concatConcatenate Amazon S3 files remotely using flexible patterns
Stars: ✭ 32 (-68.93%)
estrattoparsing fixed width files content made easy
Stars: ✭ 12 (-88.35%)
dif'dif' is a Linux preprocessing front end to gvimdiff/meld/kompare
Stars: ✭ 18 (-82.52%)
SuperCombinators[Deprecated] A Swift parser combinator framework
Stars: ✭ 19 (-81.55%)
synsyn - the thesaurus
Stars: ✭ 45 (-56.31%)
frangipanniProgram to convert lines of text into a tree structure.
Stars: ✭ 1,176 (+1041.75%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (-56.31%)
ConTextoLibrería en Python para minería de texto y NLP
Stars: ✭ 43 (-58.25%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (-41.75%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (-35.92%)
r4stringsHandling Strings in R
Stars: ✭ 39 (-62.14%)
WeTextProcessingText Normalization & Inverse Text Normalization
Stars: ✭ 213 (+106.8%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-88.35%)
rake-rsMultilingual implementation of RAKE algorithm for Rust
Stars: ✭ 30 (-70.87%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+98.06%)
Regex AutomataA low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (+97.09%)
Rust UnicUNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (+83.5%)
SdIntuitive find & replace CLI (sed alternative)
Stars: ✭ 2,755 (+2574.76%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+2269.9%)
Text DetectorTool which allow you to detect and translate text.
Stars: ✭ 173 (+67.96%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+62.14%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+53.4%)
JaconvPure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Stars: ✭ 157 (+52.43%)
Japanese.jsUtil collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (+45.63%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+43.69%)
BrowsecloudA web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (+38.83%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+37.86%)
TmtoolkitText Mining and Topic Modeling Toolkit for Python with parallel processing power
Stars: ✭ 135 (+31.07%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (+26.21%)
Konoha🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (+26.21%)
LibasciidocA Golang library for processing Asciidoc files.
Stars: ✭ 129 (+25.24%)
PadatiousA neural network intent parser
Stars: ✭ 124 (+20.39%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+20.39%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+11.65%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (+11.65%)
Colibri CoreColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (+8.74%)
Command Line Text Processing⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
Stars: ✭ 9,771 (+9386.41%)