All Projects → TextDatasetCleaner → Similar Projects or Alternatives

385 Open source projects that are alternatives of or similar to TextDatasetCleaner

class-norm
Class Normalization for Continual Zero-Shot Learning
Stars: ✭ 34 (+25.93%)
Mutual labels:  normalization
Text Detector
Tool which allow you to detect and translate text.
Stars: ✭ 173 (+540.74%)
Mutual labels:  text-processing
lameta
The Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-33.33%)
Mutual labels:  linguistics
Nlpre
Python library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+485.19%)
Mutual labels:  text-processing
readability
Fast readability scores for text data
Stars: ✭ 22 (-18.52%)
Mutual labels:  text-mining
knime-textprocessing
KNIME - Text Processing Extension (Labs)
Stars: ✭ 17 (-37.04%)
Mutual labels:  text-processing
lambda-notebook
Lambda Notebook: Formal Semantics in Jupyter
Stars: ✭ 16 (-40.74%)
Mutual labels:  linguistics
aera-workshop
This workshop introduces participants to the Learning Analytics (LA), and provides a brief overview of LA methodologies, literature, applications, and ethical issues as they relate to STEM education.
Stars: ✭ 14 (-48.15%)
Mutual labels:  text-mining
Stanza Old
Stanford NLP group's shared Python tools.
Stars: ✭ 142 (+425.93%)
Mutual labels:  text-processing
WonderfulPolishLanguage
This is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (+14.81%)
Mutual labels:  linguistics
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (+381.48%)
Mutual labels:  text-processing
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-37.04%)
Mutual labels:  preprocessing
Libasciidoc
A Golang library for processing Asciidoc files.
Stars: ✭ 129 (+377.78%)
Mutual labels:  text-processing
Dan Jurafsky Chris Manning Nlp
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+359.26%)
Mutual labels:  text-processing
RainNet
[CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization
Stars: ✭ 125 (+362.96%)
Mutual labels:  normalization
Bpl
Binary Processing Language
Stars: ✭ 103 (+281.48%)
Mutual labels:  text-processing
twitter-text-python
Twitter Text Libraries for Python
Stars: ✭ 22 (-18.52%)
Mutual labels:  text-processing
Mtp
Multi-lingual Text Processing
Stars: ✭ 87 (+222.22%)
Mutual labels:  text-processing
Textrude
Code generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (+192.59%)
Mutual labels:  text-processing
Ios11 Visionframework
Vision Framework IOS WWDC 2017
Stars: ✭ 85 (+214.81%)
Mutual labels:  text-processing
Awesome-CyberSec-Resources
An awesome collection of curated Cyber Security resources(Books, Tutorials, Blogs, Podcasts, ...)
Stars: ✭ 273 (+911.11%)
Mutual labels:  hactoberfest2021
Kefirbb
A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (+207.41%)
Mutual labels:  text-processing
malay-dataset
Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+600%)
Mutual labels:  text-mining
Virastar
Cleaning-up Persian Texts!
Stars: ✭ 77 (+185.19%)
Mutual labels:  text-processing
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+703.7%)
Mutual labels:  text-mining
Javascript Text Expander
Expands texts as you type, naturally
Stars: ✭ 58 (+114.81%)
Mutual labels:  text-processing
named-entity-recognition
Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities
Stars: ✭ 18 (-33.33%)
Mutual labels:  text-mining
Lingua Franca
Mycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (+88.89%)
Mutual labels:  text-processing
Qminer
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+662.96%)
Mutual labels:  text-mining
Qp Trie Rs
An idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (+74.07%)
Mutual labels:  text-processing
learn perl oneliners
Example based guide for text processing with perl from the command line
Stars: ✭ 63 (+133.33%)
Mutual labels:  text-processing
Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-14.81%)
Mutual labels:  text-processing
Fake news detection
Fake News Detection in Python
Stars: ✭ 194 (+618.52%)
Mutual labels:  text-mining
Gohn
Hatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-37.04%)
Mutual labels:  text-processing
tap
Text Analytics Pipeline (TAP)
Stars: ✭ 17 (-37.04%)
Mutual labels:  text-analytics
Python Nameparser
A simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+1611.11%)
Mutual labels:  text-processing
Hdltex
HDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+607.41%)
Mutual labels:  text-mining
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+1522.22%)
Mutual labels:  text-processing
semantria-sdk
Semantria SDK
Stars: ✭ 38 (+40.74%)
Mutual labels:  text-analytics
Aho Corasick
A fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+1470.37%)
Mutual labels:  text-processing
Texthero
Text preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+8814.81%)
Mutual labels:  text-mining
Textpipe
Textpipe: clean and extract metadata from text
Stars: ✭ 284 (+951.85%)
Mutual labels:  text-processing
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+0%)
Mutual labels:  text-mining
daachorse
🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure.
Stars: ✭ 75 (+177.78%)
Mutual labels:  text-processing
Multi rake
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Stars: ✭ 162 (+500%)
Mutual labels:  text-mining
gnu-linux-shell-scripting
A foundation for GNU/Linux shell scripting
Stars: ✭ 23 (-14.81%)
Mutual labels:  text-processing
PubMed-Best-Match
Machine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+33.33%)
Mutual labels:  text-mining
typ3r.js
🍟 [Library] dA aNn0Y1Ng t3Xt g3NeRa7or
Stars: ✭ 22 (-18.52%)
Mutual labels:  text-processing
Lazynlp
Library to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+7251.85%)
Mutual labels:  text-mining
stringx
Drop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-48.15%)
Mutual labels:  text-processing
textreadr
Tools to uniformly read in text data including semi-structured transcripts
Stars: ✭ 65 (+140.74%)
Mutual labels:  text-mining
Awesome Text Classification
Awesome-Text-Classification Projects,Papers,Tutorial .
Stars: ✭ 158 (+485.19%)
Mutual labels:  text-mining
Quran-and-Arabic-Language-Repository
Projects & Libraries related to Quran & Arabic Language
Stars: ✭ 26 (-3.7%)
Mutual labels:  text-mining
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (-7.41%)
Mutual labels:  text-mining
SwitchNorm Detection
The code of Switchable Normalization for object detection based on Detectron.pytorch.
Stars: ✭ 79 (+192.59%)
Mutual labels:  normalization
Adjutant
Runs a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+118.52%)
Mutual labels:  text-mining
reader
Distant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-33.33%)
Mutual labels:  text-mining
dmriprep
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
Stars: ✭ 55 (+103.7%)
Mutual labels:  preprocessing
event-embedding-multitask
*SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach
Stars: ✭ 22 (-18.52%)
Mutual labels:  linguistics
sensim
Sentence Similarity Estimator (SenSim)
Stars: ✭ 15 (-44.44%)
Mutual labels:  text-mining
301-360 of 385 similar projects