Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → cpsievert → Ldavis

cpsievert / Ldavis

Licence: other

R package for web-based interactive topic model visualization.

Programming Languages

184084 projects - #8 most used programming language

7636 projects

Labels

visualization text-mining topic-modeling

Projects that are alternatives of or similar to Ldavis

Learning Social Media Analytics With R

This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt

Stars: ✭ 102 (-78.11%)

Mutual labels: text-mining, topic-modeling

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Stars: ✭ 91 (-80.47%)

Mutual labels: text-mining, topic-modeling

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+269.53%)

Mutual labels: text-mining, topic-modeling

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

Stars: ✭ 715 (+53.43%)

Mutual labels: text-mining, topic-modeling

2018 Machinelearning Lectures Esa

Machine Learning Lectures at the European Space Agency (ESA) in 2018

Stars: ✭ 280 (-39.91%)

Mutual labels: text-mining, topic-modeling

Lda Topic Modeling

A PureScript, browser-based implementation of LDA topic modeling.

Stars: ✭ 91 (-80.47%)

Mutual labels: text-mining, topic-modeling

Weaving analytical stories from text data

Stars: ✭ 12 (-97.42%)

Mutual labels: text-mining, topic-modeling

How To Mine Newsfeed Data And Extract Interactive Insights In Python

A practical guide to topic mining and interactive visualizations

Stars: ✭ 61 (-86.91%)

Mutual labels: text-mining, topic-modeling

Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019

Stars: ✭ 27 (-94.21%)

Mutual labels: text-mining, topic-modeling

Conversational text Analysis using various NLP techniques

Stars: ✭ 147 (-68.45%)

Mutual labels: text-mining, topic-modeling

Fast topic modeling platform

Stars: ✭ 563 (+20.82%)

Mutual labels: text-mining, topic-modeling

Pyshorttextcategorization

Various Algorithms for Short Text Mining

Stars: ✭ 429 (-7.94%)

Mutual labels: text-mining, topic-modeling

Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"

Stars: ✭ 135 (-71.03%)

Mutual labels: text-mining, topic-modeling

[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding

Stars: ✭ 55 (-88.2%)

Mutual labels: text-mining, topic-modeling

BERT, LDA, and TFIDF based keyword extraction in Python

Stars: ✭ 33 (-92.92%)

Mutual labels: text-mining, topic-modeling

Text mining resources

Resources for learning about Text Mining and Natural Language Processing

Stars: ✭ 358 (-23.18%)

Mutual labels: text-mining, topic-modeling

This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"

Stars: ✭ 265 (-43.13%)

Mutual labels: text-mining

Artificial Adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Stars: ✭ 348 (-25.32%)

Mutual labels: text-mining

LDA topic modeling for node.js

Stars: ✭ 262 (-43.78%)

Mutual labels: topic-modeling

latent-semantic-analysis

Pipeline for training LSA models using Scikit-Learn.

Stars: ✭ 20 (-95.71%)

Mutual labels: topic-modeling

View All Similar Projects ➔

LDAvis

R package for interactive topic model visualization.

LDAvis is designed to help users interpret the topics in a topic model that has been fit to a corpus of text data. The package extracts information from a fitted LDA topic model to inform an interactive web-based visualization.

Installing the package

Stable version on CRAN:

install.packages("LDAvis")

Development version on GitHub (with devtools):

devtools::install_github("cpsievert/LDAvis")

Getting started

Once installed, we recommend a visit to the main help page:

library(LDAvis)
help(createJSON, package = "LDAvis")

The documentation and example on the bottom of that page should provide a quick sense of how to create (and share) your own visualizations. If you want more details about the technical specifications of the visualization, see the vignette:

vignette("details", package = "LDAvis")

Note that LDAvis itself does not provide facilities for fitting the model (only visualizing a fitted model). If you want to perform LDA in R, there are several packages, including mallet, lda, and topicmodels.

If you want to perform LDA with the R package lda and visualize the result with LDAvis, our example of a 20-topic model fit to 2,000 movie reviews may be helpful.

LDAvis does not limit you to topic modeling facilities in R. If you use other tools (MALLET and gensim are popular), we recommend that you visit our Twenty Newsgroups example to help quickly understand what components LDAvis will need.

Sharing a Visualization

To share a visualization that you created using LDAvis, you can encode the state of the visualization into the URL by appending a string of the form:

"#topic=k&lambda=l&term=s"

to the end of the URL, where "k", "l", and "s" are strings indicating the desired values of the selected topic, the value of lambda, and the selected term, respectively. For more details, see the last section of our Movie Reviews example, or for a quick example, see the link here:

https://ldavis.cpsievert.me/reviews/vis/#topic=3&lambda=0.6&term=cop

Video demos

More documentation

To read about the methodology behind LDAvis, see our paper, which we presented at the 2014 ACL Workshop on Interactive Language Learning, Visualization, and Interfaces in Baltimore on June 27, 2014.

Additional data

We included one data set in LDAvis, 'TwentyNewsgroups', which consists of a list with 5 elements:

phi, a matrix with the topic-term distributions
theta, a matrix with the document-topic distributions
doc.length, a numeric vector with token counts for each document
vocab, a character vector containing the terms
term.frequency, a numeric vector of observed term frequencies

We also created a second data-only package called LDAvisData to hold additional example data sets. Currently there are three more examples available there:

Movie Reviews (a 20-topic model fit to 2,000 movie reviews)
AP (a 40-topic model fit to approximately 2,246 news articles)
Jeopardy (a 100-topic model fit to approximately 20,000 Jeopardy questions)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 466

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (31) 🔗