typ3r.js🍟 [Library] dA aNn0Y1Ng t3Xt g3NeRa7or
Stars: ✭ 22 (-78.64%)
Compare-UserJSPowerShell script for comparing user.js (or prefs.js) files.
Stars: ✭ 79 (-23.3%)
Python NameparserA simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+348.54%)
NLP-toolsUseful python NLP tools (evaluation, GUI interface, tokenization)
Stars: ✭ 39 (-62.14%)
nlcliNatural language interface for the command line.
Stars: ✭ 21 (-79.61%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-73.79%)
textstatRuby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
Stars: ✭ 25 (-75.73%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+313.59%)
daachorse🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure.
Stars: ✭ 75 (-27.18%)
sova-tts-tpsNLP-preprocessor for the SOVA-TTS project
Stars: ✭ 44 (-57.28%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-75.73%)
TerText Expression Runner – Readable and easy to use text expressions
Stars: ✭ 67 (-34.95%)
hckA sharp cut(1) clone.
Stars: ✭ 542 (+426.21%)
GohnHatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-83.5%)
pwsh-preludePowerShell “standard” library for supercharging your productivity. Provides a powerful cross-platform scripting environment enabling efficient analysis and sustainable science in myriad contexts.
Stars: ✭ 26 (-74.76%)
Node RakeA NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.
Stars: ✭ 85 (-17.48%)
lingua-go👄 The most accurate natural language detection library for Go, suitable for long and short text alike
Stars: ✭ 684 (+564.08%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+325.24%)
hama-py🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
Stars: ✭ 16 (-84.47%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-44.66%)
finglishA Finglish to Persian converter.
Stars: ✭ 60 (-41.75%)
BsedSimple SQL-like syntax on top of Perl text processing.
Stars: ✭ 414 (+301.94%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-81.55%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (-61.17%)
Qp Trie RsAn idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (-54.37%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-53.4%)
VirastarCleaning-up Persian Texts!
Stars: ✭ 77 (-25.24%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+37.86%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-77.67%)
textQiniu Text Processing Libraries for Go
Stars: ✭ 25 (-75.73%)
Chr🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-82.52%)
stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-86.41%)
andaluh-jsTransliterate español (spanish) spelling to andaluz proposals using javascript
Stars: ✭ 22 (-78.64%)
WhatlanggoNatural language detection library for Go
Stars: ✭ 479 (+365.05%)
HrEasy Access to Uppercase H
Stars: ✭ 56 (-45.63%)
MtpMulti-lingual Text Processing
Stars: ✭ 87 (-15.53%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-82.52%)
Diff Match PatchDiff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+4666.99%)
cinjeA Pythonic and ultra fast template engine DSL.
Stars: ✭ 26 (-74.76%)
Go Search Replace🚀 Search & replace URLs in WordPress SQL files.
Stars: ✭ 57 (-44.66%)
TextrudeCode generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (-23.3%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+320.39%)
KefirbbA flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (-19.42%)
SuffixTreeOptimized implementation of suffix tree in python using Ukkonen's algorithm.
Stars: ✭ 38 (-63.11%)
Aho CorasickA fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+311.65%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (-32.04%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-50.49%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+237.86%)
NostrilNostril: Nonsense String Evaluator
Stars: ✭ 86 (-16.5%)
PyparsingPython library for creating PEG parsers
Stars: ✭ 1,052 (+921.36%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+175.73%)