All Projects → kethort → Twitterldatopicmodeling

kethort / Twitterldatopicmodeling

Uses topic modeling to identify context between follower relationships of Twitter users

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Twitterldatopicmodeling

Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-37.5%)
Mutual labels:  nltk, topic-modeling, gensim
Sarcasm Detection
Detecting Sarcasm on Twitter using both traditonal machine learning and deep learning techniques.
Stars: ✭ 73 (+52.08%)
Mutual labels:  topic-modeling, twitter, tweets
React Static Tweets
Extremely fast static renderer for tweets.
Stars: ✭ 278 (+479.17%)
Mutual labels:  twitter, tweets
Tweetie
Simple jQuery Twitter feed plugin
Stars: ✭ 314 (+554.17%)
Mutual labels:  twitter, tweets
Tweetscraper
TweetScraper is a simple crawler/spider for Twitter Search without using API
Stars: ✭ 694 (+1345.83%)
Mutual labels:  twitter, tweets
archive-explorer-web
Browse your Twitter archive with a friendly, responsive, full experience, and quickly delete the tweets you don't want.
Stars: ✭ 19 (-60.42%)
Mutual labels:  twitter, tweets
resume tailor
An unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
Stars: ✭ 15 (-68.75%)
Mutual labels:  nltk, gensim
Linqtotwitter
LINQ Provider for the Twitter API (C# Twitter Library)
Stars: ✭ 401 (+735.42%)
Mutual labels:  twitter, tweets
TwitterPiBot
A Python based bot for Raspberry Pi that grabs tweets with a specific hashtag and reads them out loud.
Stars: ✭ 85 (+77.08%)
Mutual labels:  twitter, tweets
Twitter sentiment analysis word2vec convnet
Twitter Sentiment Analysis with Gensim Word2Vec and Keras Convolutional Network
Stars: ✭ 24 (-50%)
Mutual labels:  gensim, twitter
Twitter Post Fetcher
Fetch your twitter posts without using the new Twitter 1.1 API. Pure JavaScript! By Jason Mayes
Stars: ✭ 886 (+1745.83%)
Mutual labels:  twitter, tweets
Twweet Cli
🐦 Tweet right from your cli without even opening your browser.
Stars: ✭ 47 (-2.08%)
Mutual labels:  twitter, tweets
twitter-analytics-wrapper
A simple Python wrapper to download tweets data from the Twitter Analytics platform. Particularly interesting for the impressions metrics that are unavailable on current Twitter API. Also works for the videos data.
Stars: ✭ 44 (-8.33%)
Mutual labels:  twitter, tweets
pydataberlin-2017
Repo for my talk at the PyData Berlin 2017 conference
Stars: ✭ 63 (+31.25%)
Mutual labels:  topic-modeling, gensim
Twitter Scraper
Scrape the Twitter Frontend API without authentication.
Stars: ✭ 3,037 (+6227.08%)
Mutual labels:  twitter, tweets
TwEater
A Python Bot for Scraping Conversations from Twitter
Stars: ✭ 16 (-66.67%)
Mutual labels:  twitter, tweets
Yotter
Youtube and Twitter with privacy.
Stars: ✭ 376 (+683.33%)
Mutual labels:  twitter, tweets
Guffer
Guffer tweets based on a daily schedule
Stars: ✭ 12 (-75%)
Mutual labels:  twitter, tweets
awesome-twitter-bots
A Curated Collection of the Best Twitter Bots 🤖
Stars: ✭ 99 (+106.25%)
Mutual labels:  twitter, tweets
Archive-Tweets
Archive and Delete Liked and Posted Tweets
Stars: ✭ 28 (-41.67%)
Mutual labels:  twitter, tweets

Description

Twitter users often associate and socialize with other users based on similar interests. The Tweets of these users can be classified using a trained LDA model to automate the discovery of their similarities.

Prerequisites

Python 2.7 is recommended since the pattern library is currently incompatible with most Python 3 versions.

Python 3.6 can be used with the pattern library, though it may need to be built from source since most newer Linux distributions don't come with it pre-installed. The commands to build Python 3.6 from source are provided in the linux_setup_py3.6.sh script.

Installing

Linux

Download:

git clone https://github.com/kethort/twitter_LDA_topic_modeling.git

Run bash script:

./linux_setup_py3.6.sh

Python pip requirements included in these files:

# for Python 2.7
pip install -r requirements_py2.txt

# for Python 3
pip install -r requirements_py3.txt

Link to the simple-wikipedia dump:

https://dumps.wikimedia.org/simplewiki/latest/simplewiki-latest-pages-articles.xml.bz2

Mac osx

The installation is very similar to the linux installation:

extra install instructions in osx_setup_py3.6.info

pip install -r requirements_py3_OSX.txt

Process

  1. Get user and follower ids by location - twitter_user_grabber.py
  2. Download Tweets for each user - get_community_tweets.py
  3. Create an LDA model from a corpus of documents - create_LDA_model.py
  4. Generate topic probability distributions for Tweet documents - tweets_on_LDA.py
  5. Calculate distances between Tweet documents and graph them - plot_distances.py

Sample Visualizations

Built With

  • Gensim - Package for creating LDA model
  • pyLDAvis - Package for visualizing LDA model
  • Tweepy - Package for interacting with Twitter REST API
  • NLTK - Package for stopword management and tokenization
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].