All Projects â†’ tdhopper â†’ Pydata Nyc 2015

tdhopper / Pydata Nyc 2015

📜 Understanding Probabilistic Topic Models with Simulation in Python

Projects that are alternatives of or similar to Pydata Nyc 2015

Pico
Object Detection and Analysis Made easy using Raspberry Pi, Apache Kafka, AWS Rekognition & Docker
Stars: ✭ 63 (-1.56%)
Mutual labels:  jupyter-notebook
Otml ds3 2018
Practical sessions for the Optimal Transport and Machine learning course at DS3 2018
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Genetic Algorithm Rnn
Using Genetic Algorithms to optimize Recurrent Neural Network's Configuration
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Vehicle Trajectory Prediction On Ngsim
Stars: ✭ 63 (-1.56%)
Mutual labels:  jupyter-notebook
Indonesian Language Models
Indonesian Language Models and its Usage
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Rmsync
A script for synchronizing the reMarkable e-reader
Stars: ✭ 63 (-1.56%)
Mutual labels:  jupyter-notebook
Processamento Digital De Sinais Financeiros
Estabelecer competências em técnicas quantitativas aplicadas ao mercado de renda variável, por meio da aplicação dos métodos de processamento digital de séries temporais.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Apmae4990
Introduction to Data Science in Industry
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning Notes
A repository to save my machine learning notes.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Deepbayes2017
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Notebooks
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Deconfounder tutorial
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Matplotlib4papers
Matplotlib examples to present results.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Bootcamp2019
Repository of syllabi, lecture notes, Jupyter notebooks, code, and problem sets for OSE Lab Boot Camp 2019
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Text Top Model
Benchmarking text classification algorithms
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Efficientnet Gradcam Visualization
EfficientNet-GradCam Visualization
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Net Analysis
Tools, libraries and applications to analyze network measurements and detect interference.
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Taxiprediction
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Dqn Pytorch
Deep Q Learning via Pytorch
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning In Finance
Stars: ✭ 64 (+0%)
Mutual labels:  jupyter-notebook

By Tim Hopper: tdhopper.com

alt text alt text

Understanding Probabilistic Topic Models By Simulation

Overview

Description

Latent Dirichlet Allocation and related topic models are often presented in the form of complicated equations and confusing diagrams. I will present LDA as a generative model through probabilistic simulation in simple Python. Simulation will help data scientists to understand the model assumptions and limitations and more effectively use black box LDA implementations.

Abstract

Those without training in probabilistic graphical models and measure theory, data scientist may have a hard time understanding Latent Dirichlet Allocation and other probabilistic topic models. However, because LDA is a generative model, we can write Python code to generated data based on the model assumptions.

The talk will progress as follows:

  • Introduction to mixture models
  • Simulation of mixture models
  • Introduction to grouped data
  • Simulation of latent Dirichlet allocation
  • Fitting and visualizing LDA with Python

Setup Conda Environment and Launch Notebook

With Conda installed, run

$ git clone https://github.com/tdhopper/pydata-nyc-2015.git understanding-lda
$ cd understanding-lda
$ make install
$ source activate understanding-lda

To view the notebook, run

$ make

To view the notebook as a slideshow, run

make slides
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].