All Projects → chiphuyen → Metrotwitter

chiphuyen / Metrotwitter

What Twitter reveals about the differences between cities and the monoculture of the Bay Area

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Metrotwitter

Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1561.54%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Bap
Bayesian Analysis with Python (Second Edition)
Stars: ✭ 379 (+628.85%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+480.77%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (+361.54%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Drugs Recommendation Using Reviews
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-32.69%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+425%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (+505.77%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+228.85%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Qs ledger
Quantified Self Personal Data Aggregator and Data Analysis
Stars: ✭ 559 (+975%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+940.38%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (+319.23%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Data Science Lunch And Learn
Resources for weekly Data Science Lunch & Learns
Stars: ✭ 49 (-5.77%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Edaviz
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (+323.08%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (+434.62%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+5407.69%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+573.08%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (+186.54%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+4394.23%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (+773.08%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+1253.85%)
Mutual labels:  jupyter-notebook, data-analysis, data-visualization

MetroTwitter

Analysis of Twitter data in 13 English-speaking metropolitan areas (96K users and 180M tweets).

  • US Cities (9): Atlanta, Austin, Bay Area, Boston, Chicago, Washington DC, LA, NYC, Seattle
  • Australian Cities (2): Melbourne, Sydney
  • Canadian Cities (1): Toronto
  • UK Cities (1): London

Plus 223K users that aren't in these areas which are collectively put in 'Other'.

For more information, see the blog post:

MetroTwitter - What Twitter reveals about the differences between cities and the monoculture of the Bay Area

The blog post only contains a small, selected number of visualizations. For more visualization, download here.

I won't be distributing the data for this project to protect users' privacy. If you'd like to discuss the data, contact me through my website huyenchip.com.

How people in different cities describe themselves

In the metrotwitter_visualization folder, you can find word clouds that represent the most popular words in bios in each city. The indi folder visualizes each city independently. The duo folder visualizes the difference between two cities.

What people in different cities talk about

In the metrotwitter_visualization folder, you can find word clouds that represent the most popular words in tweets in each city. The indi folder visualizes each city independently. The duo folder visualizes the difference between two cities.

Ranking cities by popularity of keywords

In the keywords Jupyter notebook in this GitHub repo, there's the method rank_cities_by_keyword to visualize any keyword you want, either using bios or tweets.

You can also plots multiple keywords on the same plot using the method rank_cities_by_multiple_keywords.

Ranking keywords within a city

Also in the keywords Jupyter notebook, there's the method rank_keywords_in_city to rank the popularity of keywords within a city.

Unique popular emojis in each city

Most unique city

Compare two keywords

Just to get a sense of how popular one vs another.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].