The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.

Stars: ✭ 45 (-21.05%)

Mutual labels: text-mining

liquigraph

Migrations for Neo4j

Stars: ✭ 122 (+114.04%)

Mutual labels: graph-database

perke

A keyphrase extractor for Persian

Stars: ✭ 60 (+5.26%)

Mutual labels: text-mining

named-entity-recognition

Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities

Stars: ✭ 18 (-68.42%)

Mutual labels: text-mining

palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Stars: ✭ 32 (-43.86%)

Mutual labels: text-mining

textlearnR

A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.

Stars: ✭ 16 (-71.93%)

Mutual labels: text-mining

Answerable

Recommendation system for Stack Overflow unanswered questions

Stars: ✭ 13 (-77.19%)

Mutual labels: text-mining

GraphDBLP

a Graph-based instance of DBLP

Stars: ✭ 33 (-42.11%)

Mutual labels: graph-database

database-journal

Databases: Concepts, commands, codes, interview questions and more...

Stars: ✭ 50 (-12.28%)

Mutual labels: graph-database

extractnet

A Dragnet that also extract author, headline, date, keywords from context

Stars: ✭ 52 (-8.77%)

Mutual labels: text-mining

readability

Fast readability scores for text data

Stars: ✭ 22 (-61.4%)

Mutual labels: text-mining

gofastr

Make a DocumentTermMatrix faster

Stars: ✭ 19 (-66.67%)

Mutual labels: text-mining

NoSQLDataEngineering

NoSQL Data Engineering

Stars: ✭ 25 (-56.14%)

Mutual labels: graph-database

SEDTWik-Event-Detection-from-Tweets

Segmentation based event detection from Tweets. Published at NAACL SRW 2019

Stars: ✭ 58 (+1.75%)

Mutual labels: text-mining

text-analysis

Weaving analytical stories from text data

Stars: ✭ 12 (-78.95%)

Mutual labels: text-mining

lda2vec

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

Stars: ✭ 27 (-52.63%)

Mutual labels: text-mining

mizo

Super-fast Spark RDD for Titan Graph Database on HBase

Stars: ✭ 24 (-57.89%)

Mutual labels: graph-database

Cypher.js

Cypher graph database for Javascript

Stars: ✭ 30 (-47.37%)

Mutual labels: graph-database

nebula-docker-compose

Docker compose for Nebula Graph

Stars: ✭ 84 (+47.37%)

Mutual labels: graph-database

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"

Stars: ✭ 103 (+80.7%)

Mutual labels: text-mining

Gwu data mining

Materials for GWU DNSC 6279 and DNSC 6290.

Stars: ✭ 217 (+280.7%)

Mutual labels: text-mining

RadiologyReportEmbedding

Intelligent Word Embeddings of Free-Text Radiology Reports

Stars: ✭ 22 (-61.4%)

Mutual labels: word2vec-model

Qminer

Analytic platform for real-time large-scale streams containing structured and unstructured data.

Stars: ✭ 206 (+261.4%)

Mutual labels: text-mining

Adjutant

Runs a pubmed query, returns results and allows user to explore high-level structure of returned documents

Stars: ✭ 59 (+3.51%)

Mutual labels: text-mining

Fake news detection

Fake News Detection in Python

Stars: ✭ 194 (+240.35%)

Mutual labels: text-mining

Udacity-Data-Analyst-Nanodegree

Repository for the projects needed to complete the Data Analyst Nanodegree.

Stars: ✭ 31 (-45.61%)

Mutual labels: text-mining

Hdltex

HDLTex: Hierarchical Deep Learning for Text Classification

Stars: ✭ 191 (+235.09%)

Mutual labels: text-mining

SparseLSH

A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.

Stars: ✭ 127 (+122.81%)

Mutual labels: text-mining

deduce

Deduce: de-identification method for Dutch medical text

Stars: ✭ 40 (-29.82%)

Mutual labels: text-mining

aera-workshop

This workshop introduces participants to the Learning Analytics (LA), and provides a brief overview of LA methodologies, literature, applications, and ethical issues as they relate to STEM education.

Stars: ✭ 14 (-75.44%)

Mutual labels: text-mining

sensim

Sentence Similarity Estimator (SenSim)

Stars: ✭ 15 (-73.68%)

Mutual labels: text-mining

advanced-text-mining

TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.

Stars: ✭ 15 (-73.68%)

Mutual labels: text-mining

sacred

📖 Sacred texts in R