Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Stars: ✭ 386 (+324.18%)

Mutual labels: text-mining, named-entity-recognition

Data mining

The Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms

Stars: ✭ 10 (-89.01%)

Mutual labels: data-mining, clustering

Ldavis

R package for web-based interactive topic model visualization.

Stars: ✭ 466 (+412.09%)

Mutual labels: text-mining, topic-modeling

Pyshorttextcategorization

Various Algorithms for Short Text Mining

Stars: ✭ 429 (+371.43%)

Mutual labels: text-mining, topic-modeling

Bigartm

Fast topic modeling platform

Stars: ✭ 563 (+518.68%)

Mutual labels: text-mining, topic-modeling

deduce

Deduce: de-identification method for Dutch medical text

Stars: ✭ 40 (-56.04%)

Mutual labels: text-mining, text-processing

Orange3

🍊 📊 💡 Orange: Interactive data analysis

Stars: ✭ 3,152 (+3363.74%)

Mutual labels: data-mining, clustering

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (-31.87%)

Mutual labels: data-mining, clustering

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+13925.27%)

Mutual labels: data-mining, topic-modeling

TRUNAJOD2.0

An easy-to-use library to extract indices from texts.

Stars: ✭ 18 (-80.22%)

Mutual labels: text-mining, text-processing

converse

Conversational text Analysis using various NLP techniques

Stars: ✭ 147 (+61.54%)

Mutual labels: text-mining, topic-modeling

estratto

parsing fixed width files content made easy

Stars: ✭ 12 (-86.81%)

Mutual labels: text-mining, text-processing

Matrixprofile

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.

Stars: ✭ 141 (+54.95%)

Mutual labels: data-mining, clustering

kwx

BERT, LDA, and TFIDF based keyword extraction in Python

Stars: ✭ 33 (-63.74%)

Mutual labels: text-mining, topic-modeling

support-tickets-classification

This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en

Stars: ✭ 142 (+56.04%)

Mutual labels: text-mining, text-processing

Pipeit

PipeIt is a text transformation, conversion, cleansing and extraction tool.

Stars: ✭ 57 (-37.36%)

Mutual labels: text-mining, text-processing

How To Mine Newsfeed Data And Extract Interactive Insights In Python

A practical guide to topic mining and interactive visualizations

Stars: ✭ 61 (-32.97%)

Mutual labels: text-mining, topic-modeling

Rmdl

RMDL: Random Multimodel Deep Learning for Classification

Stars: ✭ 375 (+312.09%)

Mutual labels: text-mining, data-mining

Textcluster

短文本聚类预处理模块 Short text cluster

Stars: ✭ 115 (+26.37%)

Mutual labels: text-mining, text-processing

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (+32.97%)

Mutual labels: text-mining, named-entity-recognition

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+1792.31%)

Mutual labels: text-mining, topic-modeling

Kate

Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"

Stars: ✭ 135 (+48.35%)

Mutual labels: text-mining, topic-modeling

Tadw

An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).

Stars: ✭ 43 (-52.75%)

Mutual labels: text-mining, data-mining

Metasra Pipeline

MetaSRA: normalized sample-specific metadata for the Sequence Read Archive

Stars: ✭ 33 (-63.74%)

Mutual labels: text-mining, data-mining

Applied Text Mining In Python

Repo for Applied Text Mining in Python (coursera) by University of Michigan

Stars: ✭ 59 (-35.16%)

Mutual labels: text-mining, text-processing

Text Mining

Text Mining in Python

Stars: ✭ 18 (-80.22%)

Mutual labels: text-mining, text-processing

Learning Social Media Analytics With R

This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt

Stars: ✭ 102 (+12.09%)

Mutual labels: text-mining, topic-modeling

Bagofconcepts

Python implementation of bag-of-concepts

Stars: ✭ 18 (-80.22%)

Mutual labels: text-mining, clustering

Gwu data mining

Materials for GWU DNSC 6279 and DNSC 6290.

Stars: ✭ 217 (+138.46%)

Mutual labels: text-mining, data-mining

Qminer

Analytic platform for real-time large-scale streams containing structured and unstructured data.

Stars: ✭ 206 (+126.37%)

Mutual labels: text-mining, data-mining

hangul-search-js

🇰🇷 Simple Korean text search module

Stars: ✭ 22 (-75.82%)

Mutual labels: korean-text-processing, korean-nlp

Text-Classification-LSTMs-PyTorch

The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.

Stars: ✭ 45 (-50.55%)

Mutual labels: text-mining, text-processing

Pyss3

A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI

Stars: ✭ 191 (+109.89%)

Mutual labels: text-mining, data-mining

Elki

ELKI Data Mining Toolkit

Stars: ✭ 613 (+573.63%)

Mutual labels: data-mining, clustering

Pyclustering

pyclustring is a Python, C++ data mining library.