All Projects → Guten-gutter → Similar Projects or Alternatives

161 Open source projects that are alternatives of or similar to Guten-gutter

woolly
The Text Mining Elixir
Stars: ✭ 48 (+200%)
Mutual labels:  text-mining
text-mined-synthesis public
Codes for text-mined solid-state reactions dataset
Stars: ✭ 46 (+187.5%)
Mutual labels:  text-mining
textlearnR
A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.
Stars: ✭ 16 (+0%)
Mutual labels:  text-mining
teanaps
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+468.75%)
Mutual labels:  text-mining
clustext
Easy, fast clustering of texts
Stars: ✭ 18 (+12.5%)
Mutual labels:  text-mining
neji
Flexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (+131.25%)
Mutual labels:  text-mining
intertext
Detect and visualize text reuse
Stars: ✭ 97 (+506.25%)
Mutual labels:  text-mining
TRUNAJOD2.0
An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (+12.5%)
Mutual labels:  text-mining
readability
Fast readability scores for text data
Stars: ✭ 22 (+37.5%)
Mutual labels:  text-mining
SEDTWik-Event-Detection-from-Tweets
Segmentation based event detection from Tweets. Published at NAACL SRW 2019
Stars: ✭ 58 (+262.5%)
Mutual labels:  text-mining
malay-dataset
Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+1081.25%)
Mutual labels:  text-mining
Validator.js
String validation
Stars: ✭ 18,842 (+117662.5%)
Mutual labels:  sanitization
rulr
📐 Validation and unit conversion errors in TypeScript at compile-time. Started in 2016.
Stars: ✭ 43 (+168.75%)
Mutual labels:  sanitization
TableDisentangler
Functional and structural analysis of tables in research papers (Table disentangling)
Stars: ✭ 21 (+31.25%)
Mutual labels:  text-mining
Adjutant
Runs a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+268.75%)
Mutual labels:  text-mining
corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
Stars: ✭ 16 (+0%)
Mutual labels:  text-mining
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+512.5%)
Mutual labels:  text-mining
crminer
⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'
Stars: ✭ 17 (+6.25%)
Mutual labels:  text-mining
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-6.25%)
Mutual labels:  text-mining
palladian
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Stars: ✭ 32 (+100%)
Mutual labels:  text-mining
extractnet
A Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (+225%)
Mutual labels:  text-mining
text-analysis
Weaving analytical stories from text data
Stars: ✭ 12 (-25%)
Mutual labels:  text-mining
converse
Conversational text Analysis using various NLP techniques
Stars: ✭ 147 (+818.75%)
Mutual labels:  text-mining
Bluemonday
bluemonday: a fast golang HTML sanitizer (inspired by the OWASP Java HTML Sanitizer) to scrub user generated content of XSS
Stars: ✭ 2,135 (+13243.75%)
Mutual labels:  sanitization
Twitter-Sentiment-Analyzer
Twitter Sentiment Analyzer
Stars: ✭ 13 (-18.75%)
Mutual labels:  text-mining
Search
Blue Brain text mining toolbox for semantic search and structured information extraction
Stars: ✭ 26 (+62.5%)
Mutual labels:  text-mining
filter
⏳ Provide filtering, sanitizing, and conversion of Golang data. 提供对Golang数据的过滤,净化,转换。
Stars: ✭ 53 (+231.25%)
Mutual labels:  sanitization
reader
Distant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (+12.5%)
Mutual labels:  text-mining
PubMed-Best-Match
Machine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+125%)
Mutual labels:  text-mining
learning2hash.github.io
Website for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io
Stars: ✭ 14 (-12.5%)
Mutual labels:  text-mining
iis
Information Inference Service of the OpenAIRE system
Stars: ✭ 16 (+0%)
Mutual labels:  text-mining
TabInOut
Framework for information extraction from tables
Stars: ✭ 37 (+131.25%)
Mutual labels:  text-mining
estratto
parsing fixed width files content made easy
Stars: ✭ 12 (-25%)
Mutual labels:  text-mining
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (+56.25%)
Mutual labels:  text-mining
sentometrics
An integrated framework in R for textual sentiment time series aggregation and prediction
Stars: ✭ 77 (+381.25%)
Mutual labels:  text-mining
rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
Stars: ✭ 25 (+56.25%)
Mutual labels:  text-mining
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+4343.75%)
Mutual labels:  text-mining
R.TeMiS
R.TeMiS: R Text Mining Solution
Stars: ✭ 21 (+31.25%)
Mutual labels:  text-mining
BioMedical-NLP-corpus
Biomedical NLP Corpus or Datasets.
Stars: ✭ 44 (+175%)
Mutual labels:  text-mining
html-sanitizer
HTML sanitizer, written in PHP, aiming to provide XSS-safe markup based on explicitly allowed tags, attributes and values.
Stars: ✭ 18 (+12.5%)
Mutual labels:  sanitization
Text-Classification-LSTMs-PyTorch
The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+181.25%)
Mutual labels:  text-mining
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+68.75%)
Mutual labels:  text-mining
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (+275%)
Mutual labels:  text-mining
textreadr
Tools to uniformly read in text data including semi-structured transcripts
Stars: ✭ 65 (+306.25%)
Mutual labels:  text-mining
Answerable
Recommendation system for Stack Overflow unanswered questions
Stars: ✭ 13 (-18.75%)
Mutual labels:  text-mining
thrones2vec
Using Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").
Stars: ✭ 27 (+68.75%)
Mutual labels:  text-mining
koshort
(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.
Stars: ✭ 62 (+287.5%)
Mutual labels:  text-mining
JoSH
[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (+243.75%)
Mutual labels:  text-mining
pathvalidate
A Python library to sanitize/validate a string such as filenames/file-paths/etc.
Stars: ✭ 139 (+768.75%)
Mutual labels:  sanitization
civicmine
Text mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (+18.75%)
Mutual labels:  text-mining
Sanitize
Ruby HTML and CSS sanitizer.
Stars: ✭ 1,940 (+12025%)
Mutual labels:  sanitization
odinson
Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
Stars: ✭ 59 (+268.75%)
Mutual labels:  text-mining
Govalidator
[Go] Package of validators and sanitizers for strings, numerics, slices and structs
Stars: ✭ 5,163 (+32168.75%)
Mutual labels:  sanitization
misinfo
📊 Tools to Perform ‘Misinformation’ Analysis on a Text Corpus (wrapper for methods in https://github.com/PDXBek/Misinformation)
Stars: ✭ 17 (+6.25%)
Mutual labels:  text-mining
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (+93.75%)
Mutual labels:  text-mining
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (+31.25%)
Mutual labels:  text-mining
Quran-and-Arabic-Language-Repository
Projects & Libraries related to Quran & Arabic Language
Stars: ✭ 26 (+62.5%)
Mutual labels:  text-mining
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+68.75%)
Mutual labels:  text-mining
VERSE
Vancouver Event and Relation System for Extraction
Stars: ✭ 13 (-18.75%)
Mutual labels:  text-mining
deduce
Deduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+150%)
Mutual labels:  text-mining
1-60 of 161 similar projects