All Projects → clustext → Similar Projects or Alternatives

457 Open source projects that are alternatives of or similar to clustext

Lazynlp
Library to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+10927.78%)
Mutual labels:  text-mining
Pyshorttextcategorization
Various Algorithms for Short Text Mining
Stars: ✭ 429 (+2283.33%)
Mutual labels:  text-mining
Learning Social Media Analytics With R
This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+466.67%)
Mutual labels:  text-mining
Uc Davis Cs Exams Analysis
📈 Regression and Classification with UC Davis student quiz data and exam data
Stars: ✭ 33 (+83.33%)
Mutual labels:  text-mining
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-16.67%)
Mutual labels:  text-mining
lda2vec
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+50%)
Mutual labels:  text-mining
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (+405.56%)
Mutual labels:  text-mining
Rplos
R client for the PLoS Journals API
Stars: ✭ 289 (+1505.56%)
Mutual labels:  text-mining
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+905.56%)
Mutual labels:  text-mining
Khcoder
KH Coder: for Quantitative Content Analysis or Text Mining
Stars: ✭ 126 (+600%)
Mutual labels:  text-mining
Nlppln
NLP pipeline software using common workflow language
Stars: ✭ 31 (+72.22%)
Mutual labels:  text-mining
Adjutant
Runs a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+227.78%)
Mutual labels:  text-mining
2018 Machinelearning Lectures Esa
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Stars: ✭ 280 (+1455.56%)
Mutual labels:  text-mining
R Text Data
List of textual data sources to be used for text mining in R
Stars: ✭ 85 (+372.22%)
Mutual labels:  text-mining
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1105.56%)
Mutual labels:  text-mining
tg crawler
Just a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.
Stars: ✭ 71 (+294.44%)
Mutual labels:  text-mining
Python nlp tutorial
This repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (+300%)
Mutual labels:  text-mining
snorkeling
Extracting biomedical relationships from literature with Snorkel 🏊
Stars: ✭ 56 (+211.11%)
Mutual labels:  text-mining
Chemdataextractor
Automatically extract chemical information from scientific documents
Stars: ✭ 152 (+744.44%)
Mutual labels:  text-mining
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (+238.89%)
Mutual labels:  text-mining
TRUNAJOD2.0
An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (+0%)
Mutual labels:  text-mining
thrones2vec
Using Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").
Stars: ✭ 27 (+50%)
Mutual labels:  text-mining
converse
Conversational text Analysis using various NLP techniques
Stars: ✭ 147 (+716.67%)
Mutual labels:  text-mining
Konlpy
Python package for Korean natural language processing.
Stars: ✭ 1,098 (+6000%)
Mutual labels:  text-mining
vor-knowledge-graph
🎓 Open knowledge mining and graph builder
Stars: ✭ 57 (+216.67%)
Mutual labels:  text-mining
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+722.22%)
Mutual labels:  text-mining
named-entity-recognition
Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities
Stars: ✭ 18 (+0%)
Mutual labels:  text-mining
Ngram
Fast n-Gram Tokenization
Stars: ✭ 55 (+205.56%)
Mutual labels:  text-mining
protonet-bert-text-classification
finetune bert for small dataset text classification in a few-shot learning manner using ProtoNet
Stars: ✭ 28 (+55.56%)
Mutual labels:  text-classification
advanced-text-mining
TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.
Stars: ✭ 15 (-16.67%)
Mutual labels:  text-mining
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (+138.89%)
Mutual labels:  text-mining
ipo-miner
IPO Investment via Text Mining.
Stars: ✭ 20 (+11.11%)
Mutual labels:  text-mining
Hands On Natural Language Processing With Python
This repository is for my students of Udemy. You can find all lecture codes along with mentioned files for reading in here. So, feel free to clone it and if you have any problem just raise a question.
Stars: ✭ 146 (+711.11%)
Mutual labels:  text-mining
SparseLSH
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+605.56%)
Mutual labels:  text-mining
Gsoc2018 3gm
💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (+100%)
Mutual labels:  text-mining
Guten-gutter
Strips boilerplate from Project Gutenberg text files
Stars: ✭ 16 (-11.11%)
Mutual labels:  text-mining
VERSE
Vancouver Event and Relation System for Extraction
Stars: ✭ 13 (-27.78%)
Mutual labels:  text-mining
TextDatasetCleaner
🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+50%)
Mutual labels:  text-mining
Metasra Pipeline
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (+83.33%)
Mutual labels:  text-mining
Introduction-to-text-mining-with-Python
Lectures in Urban Data Science Lab, Seoul
Stars: ✭ 25 (+38.89%)
Mutual labels:  text-mining
Datasciencer
a curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+9494.44%)
Mutual labels:  text-mining
civicmine
Text mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (+5.56%)
Mutual labels:  text-mining
Tidy Text Mining
Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
Stars: ✭ 961 (+5238.89%)
Mutual labels:  text-mining
learning2hash.github.io
Website for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io
Stars: ✭ 14 (-22.22%)
Mutual labels:  text-mining
Qminer
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+1044.44%)
Mutual labels:  text-mining
R.TeMiS
R.TeMiS: R Text Mining Solution
Stars: ✭ 21 (+16.67%)
Mutual labels:  text-mining
Spider
A configurable web spider with a easy-to-use web console
Stars: ✭ 954 (+5200%)
Mutual labels:  text-mining
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (+572.22%)
Mutual labels:  text-mining
Keywords2vec
Stars: ✭ 121 (+572.22%)
Mutual labels:  text-mining
Autophrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
Stars: ✭ 835 (+4538.89%)
Mutual labels:  text-mining
TabInOut
Framework for information extraction from tables
Stars: ✭ 37 (+105.56%)
Mutual labels:  text-mining
misinfo
📊 Tools to Perform ‘Misinformation’ Analysis on a Text Corpus (wrapper for methods in https://github.com/PDXBek/Misinformation)
Stars: ✭ 17 (-5.56%)
Mutual labels:  text-mining
Bagofconcepts
Python implementation of bag-of-concepts
Stars: ✭ 18 (+0%)
Mutual labels:  text-mining
reader
Distant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (+0%)
Mutual labels:  text-mining
Multi rake
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Stars: ✭ 162 (+800%)
Mutual labels:  text-mining
Rake Nltk
Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Stars: ✭ 793 (+4305.56%)
Mutual labels:  text-mining
neji
Flexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (+105.56%)
Mutual labels:  text-mining
Scattertext
Beautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+9466.67%)
Mutual labels:  text-mining
ConDigSum
Code for EMNLP 2021 paper "Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization"
Stars: ✭ 62 (+244.44%)
Mutual labels:  topic
Aravec
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+1227.78%)
Mutual labels:  text-mining
61-120 of 457 similar projects