ConTextoLibrería en Python para minería de texto y NLP
Stars: ✭ 43 (+95.45%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+827.27%)
learn perl onelinersExample based guide for text processing with perl from the command line
Stars: ✭ 63 (+186.36%)
SuperCombinators[Deprecated] A Swift parser combinator framework
Stars: ✭ 19 (-13.64%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+618.18%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+218.18%)
WeTextProcessingText Normalization & Inverse Text Normalization
Stars: ✭ 213 (+868.18%)
cinjeA Pythonic and ultra fast template engine DSL.
Stars: ✭ 26 (+18.18%)
SdIntuitive find & replace CLI (sed alternative)
Stars: ✭ 2,755 (+12422.73%)
fuzzychineseA small package to fuzzy match chinese words
Stars: ✭ 50 (+127.27%)
dif'dif' is a Linux preprocessing front end to gvimdiff/meld/kompare
Stars: ✭ 18 (-18.18%)
BrowsecloudA web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (+550%)
SuffixTreeOptimized implementation of suffix tree in python using Ukkonen's algorithm.
Stars: ✭ 38 (+72.73%)
frangipanniProgram to convert lines of text into a tree structure.
Stars: ✭ 1,176 (+5245.45%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-18.18%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+200%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+81.82%)
rake-rsMultilingual implementation of RAKE algorithm for Rust
Stars: ✭ 30 (+36.36%)
andaluh-jsTransliterate español (spanish) spelling to andaluz proposals using javascript
Stars: ✭ 22 (+0%)
Rust UnicUNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (+759.09%)
vi-rsVietnamese Input Method library
Stars: ✭ 69 (+213.64%)
Text DetectorTool which allow you to detect and translate text.
Stars: ✭ 173 (+686.36%)
TextrudeCode generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (+259.09%)
Japanese.jsUtil collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (+581.82%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+313.64%)
estrattoparsing fixed width files content made easy
Stars: ✭ 12 (-45.45%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+545.45%)
nlcliNatural language interface for the command line.
Stars: ✭ 21 (-4.55%)
Compare-UserJSPowerShell script for comparing user.js (or prefs.js) files.
Stars: ✭ 79 (+259.09%)
synsyn - the thesaurus
Stars: ✭ 45 (+104.55%)
finglishA Finglish to Persian converter.
Stars: ✭ 60 (+172.73%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+104.55%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+22.73%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+172.73%)
sova-tts-tpsNLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (+100%)
r4stringsHandling Strings in R
Stars: ✭ 39 (+77.27%)
lingua-go👄 The most accurate natural language detection library for Go, suitable for long and short text alike
Stars: ✭ 684 (+3009.09%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-45.45%)
hckA sharp cut(1) clone.
Stars: ✭ 542 (+2363.64%)
Regex AutomataA low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (+822.73%)
s3-utilsUtilities and tools based around Amazon S3 to provide convenience APIs in a CLI
Stars: ✭ 45 (+104.55%)
Pyarabicpyarabic
Stars: ✭ 183 (+731.82%)
textstatRuby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
Stars: ✭ 25 (+13.64%)
FastnlpfastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+10995.45%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (+22.73%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+659.09%)
pwsh-preludePowerShell “standard” library for supercharging your productivity. Provides a powerful cross-platform scripting environment enabling efficient analysis and sustainable science in myriad contexts.
Stars: ✭ 26 (+18.18%)
JaconvPure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Stars: ✭ 157 (+613.64%)
Emotion-recognition-from-tweetsA comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Stars: ✭ 17 (-22.73%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+572.73%)
hama-py🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
Stars: ✭ 16 (-27.27%)
text2videoText to Video Generation Problem
Stars: ✭ 28 (+27.27%)
stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-36.36%)
HrEasy Access to Uppercase H
Stars: ✭ 56 (+154.55%)
s3-concatConcatenate Amazon S3 files remotely using flexible patterns
Stars: ✭ 32 (+45.45%)