All Projects → WeTextProcessing → Similar Projects or Alternatives

144 Open source projects that are alternatives of or similar to WeTextProcessing

TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-87.32%)
Mutual labels:  text-processing, normalization
Dan Jurafsky Chris Manning Nlp
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-41.78%)
Mutual labels:  text-processing
Javascript Text Expander
Expands texts as you type, naturally
Stars: ✭ 58 (-72.77%)
Mutual labels:  text-processing
Text Mining
Text Mining in Python
Stars: ✭ 18 (-91.55%)
Mutual labels:  text-processing
Unix Text Commands
Unix Text Processing Command Reference
Stars: ✭ 78 (-63.38%)
Mutual labels:  text-processing
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (-38.97%)
Mutual labels:  text-processing
Pyparsing
Python library for creating PEG parsers
Stars: ✭ 1,052 (+393.9%)
Mutual labels:  text-processing
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-21.6%)
Mutual labels:  text-processing
Command Line Text Processing
⚡ From finding text to search and replace, from sorting to beautifying text and more 🎨
Stars: ✭ 9,771 (+4487.32%)
Mutual labels:  text-processing
Python Nameparser
A simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+116.9%)
Mutual labels:  text-processing
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+100%)
Mutual labels:  text-processing
Node Rake
A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.
Stars: ✭ 85 (-60.09%)
Mutual labels:  text-processing
Stanza Old
Stanford NLP group's shared Python tools.
Stars: ✭ 142 (-33.33%)
Mutual labels:  text-processing
Ter
Text Expression Runner – Readable and easy to use text expressions
Stars: ✭ 67 (-68.54%)
Mutual labels:  text-processing
Fastnlp
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Stars: ✭ 2,441 (+1046.01%)
Mutual labels:  text-processing
Pipeit
PipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-73.24%)
Mutual labels:  text-processing
Libasciidoc
A Golang library for processing Asciidoc files.
Stars: ✭ 129 (-39.44%)
Mutual labels:  text-processing
Fxt
A large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-88.26%)
Mutual labels:  text-processing
Regex Automata
A low level regular expression library that uses deterministic finite automata.
Stars: ✭ 203 (-4.69%)
Mutual labels:  text-processing
Gohn
Hatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-92.02%)
Mutual labels:  text-processing
Textcluster
短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (-46.01%)
Mutual labels:  text-processing
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+105.63%)
Mutual labels:  text-processing
Jaconv
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Stars: ✭ 157 (-26.29%)
Mutual labels:  text-processing
Text classification
Text Classification Algorithms: A Survey
Stars: ✭ 1,276 (+499.06%)
Mutual labels:  text-processing
Bsed
Simple SQL-like syntax on top of Perl text processing.
Stars: ✭ 414 (+94.37%)
Mutual labels:  text-processing
Textpipe
Textpipe: clean and extract metadata from text
Stars: ✭ 284 (+33.33%)
Mutual labels:  text-processing
Ios11 Visionframework
Vision Framework IOS WWDC 2017
Stars: ✭ 85 (-60.09%)
Mutual labels:  text-processing
Browsecloud
A web app to create and browse text visualizations for automated customer listening.
Stars: ✭ 143 (-32.86%)
Mutual labels:  text-processing
Kefirbb
A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (-61.03%)
Mutual labels:  text-processing
Sd
Intuitive find & replace CLI (sed alternative)
Stars: ✭ 2,755 (+1193.43%)
Mutual labels:  text-processing
Virastar
Cleaning-up Persian Texts!
Stars: ✭ 77 (-63.85%)
Mutual labels:  text-processing
Tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
Stars: ✭ 135 (-36.62%)
Mutual labels:  text-processing
Applied Text Mining In Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan
Stars: ✭ 59 (-72.3%)
Mutual labels:  text-processing
Stringi
THE String Processing Package for R (with ICU)
Stars: ✭ 204 (-4.23%)
Mutual labels:  text-processing
Go Search Replace
🚀 Search & replace URLs in WordPress SQL files.
Stars: ✭ 57 (-73.24%)
Mutual labels:  text-processing
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-38.97%)
Mutual labels:  text-processing
Lingua Franca
Mycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-76.06%)
Mutual labels:  text-processing
Text Detector
Tool which allow you to detect and translate text.
Stars: ✭ 173 (-18.78%)
Mutual labels:  text-processing
Qp Trie Rs
An idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (-77.93%)
Mutual labels:  text-processing
Padatious
A neural network intent parser
Stars: ✭ 124 (-41.78%)
Mutual labels:  text-processing
Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-89.2%)
Mutual labels:  text-processing
rake-rs
Multilingual implementation of RAKE algorithm for Rust
Stars: ✭ 30 (-85.92%)
Mutual labels:  text-processing
Chr
🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-91.55%)
Mutual labels:  text-processing
Cogcomp Nlpy
CogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-46.01%)
Mutual labels:  text-processing
Whatlanggo
Natural language detection library for Go
Stars: ✭ 479 (+124.88%)
Mutual labels:  text-processing
Nlpre
Python library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (-25.82%)
Mutual labels:  text-processing
Diff Match Patch
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+2205.16%)
Mutual labels:  text-processing
Colibri Core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Stars: ✭ 112 (-47.42%)
Mutual labels:  text-processing
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+103.29%)
Mutual labels:  text-processing
Rust Unic
UNIC: Unicode and Internationalization Crates for Rust
Stars: ✭ 189 (-11.27%)
Mutual labels:  text-processing
Aho Corasick
A fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+99.06%)
Mutual labels:  text-processing
Bpl
Binary Processing Language
Stars: ✭ 103 (-51.64%)
Mutual labels:  text-processing
Artificial Adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+63.38%)
Mutual labels:  text-processing
Japanese.js
Util collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
Stars: ✭ 150 (-29.58%)
Mutual labels:  text-processing
ArabicProcessingCog
A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-91.08%)
Mutual labels:  text-processing
Mtp
Multi-lingual Text Processing
Stars: ✭ 87 (-59.15%)
Mutual labels:  text-processing
text-analysis
Weaving analytical stories from text data
Stars: ✭ 12 (-94.37%)
Mutual labels:  text-processing
twitter-text-python
Twitter Text Libraries for Python
Stars: ✭ 22 (-89.67%)
Mutual labels:  text-processing
Pyarabic
pyarabic
Stars: ✭ 183 (-14.08%)
Mutual labels:  text-processing
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-30.52%)
Mutual labels:  text-processing
1-60 of 144 similar projects