ConTextoLibrería en Python para minería de texto y NLP
Stars: ✭ 43 (-4.44%)
BplBinary Processing Language
Stars: ✭ 103 (+128.89%)
leximavenA command line tool for searching word-related APIs.
Stars: ✭ 20 (-55.56%)
MtpMulti-lingual Text Processing
Stars: ✭ 87 (+93.33%)
PLNmodelsA collection of Poisson lognormal models for multivariate count data analysis
Stars: ✭ 44 (-2.22%)
poet-assistantAndroid app with rhyming dictionary, thesaurus, and dictionary, with text-to-speech functionality to read your poem.
Stars: ✭ 64 (+42.22%)
KefirbbA flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (+84.44%)
sliceslice-rsA fast implementation of single-pattern substring search using SIMD acceleration.
Stars: ✭ 66 (+46.67%)
VirastarCleaning-up Persian Texts!
Stars: ✭ 77 (+71.11%)
MeilisearchPowerful, fast, and an easy to use search engine
Stars: ✭ 20,236 (+44868.89%)
LuminescenceDevelopment of the R package 'Luminescence'
Stars: ✭ 13 (-71.11%)
JaconvPure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Stars: ✭ 157 (+248.89%)
typ3r.js🍟 [Library] dA aNn0Y1Ng t3Xt g3NeRa7or
Stars: ✭ 22 (-51.11%)
Go Search Replace🚀 Search & replace URLs in WordPress SQL files.
Stars: ✭ 57 (+26.67%)
Synonyms🌿 中文近义词:聊天机器人,智能问答工具包
Stars: ✭ 4,027 (+8848.89%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (+13.33%)
opencvR bindings for OpenCV
Stars: ✭ 123 (+173.33%)
Qp Trie RsAn idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (+4.44%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-48.89%)
geodaDataData package for accessing GeoDa datasets using R
Stars: ✭ 15 (-66.67%)
StringiTHE String Processing Package for R (with ICU)
Stars: ✭ 204 (+353.33%)
WhatlanggoNatural language detection library for Go
Stars: ✭ 479 (+964.44%)
WeTextProcessingText Normalization & Inverse Text Normalization
Stars: ✭ 213 (+373.33%)
Diff Match PatchDiff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+10811.11%)
Rust UnicUNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (+320%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+862.22%)
statesCreate country-year/month/day panels consistent with the COW or Gleditsch & Ward independent states lists
Stars: ✭ 13 (-71.11%)
Aho CorasickA fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+842.22%)
SdIntuitive find & replace CLI (sed alternative)
Stars: ✭ 2,755 (+6022.22%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+673.33%)
rake-rsMultilingual implementation of RAKE algorithm for Rust
Stars: ✭ 30 (-33.33%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-57.78%)
Text DetectorTool which allow you to detect and translate text.
Stars: ✭ 173 (+284.44%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+6.67%)
rdomainsClassifying the content of domains
Stars: ✭ 47 (+4.44%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+215.56%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+251.11%)
textQiniu Text Processing Libraries for Go
Stars: ✭ 25 (-44.44%)
cricketdataInternational cricket data for men and women, Tests, ODIs and T20s
Stars: ✭ 66 (+46.67%)
Japanese.jsUtil collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (+233.33%)
hckA sharp cut(1) clone.
Stars: ✭ 542 (+1104.44%)
stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-68.89%)
suppdataGrabbing SUPPlementary DATA in R
Stars: ✭ 31 (-31.11%)
DoReMIFaSolTéléchargement des données sur le site de l'Insee
Stars: ✭ 25 (-44.44%)
synonym-extractorExtract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm
Stars: ✭ 38 (-15.56%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+228.89%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-40%)
andaluh-jsTransliterate español (spanish) spelling to andaluz proposals using javascript
Stars: ✭ 22 (-51.11%)
BrowsecloudA web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (+217.78%)
pwsh-preludePowerShell “standard” library for supercharging your productivity. Provides a powerful cross-platform scripting environment enabling efficient analysis and sustainable science in myriad contexts.
Stars: ✭ 26 (-42.22%)
HrEasy Access to Uppercase H
Stars: ✭ 56 (+24.44%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+215.56%)
Compare-UserJSPowerShell script for comparing user.js (or prefs.js) files.
Stars: ✭ 79 (+75.56%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-60%)
TmtoolkitText Mining and Topic Modeling Toolkit for Python with parallel processing power
Stars: ✭ 135 (+200%)
lingua-go👄 The most accurate natural language detection library for Go, suitable for long and short text alike
Stars: ✭ 684 (+1420%)
cinjeA Pythonic and ultra fast template engine DSL.
Stars: ✭ 26 (-42.22%)