All Projects → jaeyk → covid19antiasian

jaeyk / covid19antiasian

Licence: other
Investigating how COVID-19 shaped Anti-Asian Climate

Programming Languages

HTML
75241 projects
javascript
184084 projects - #8 most used programming language
Jupyter Notebook
11667 projects
r
7636 projects
shell
77523 projects
python
139335 projects - #7 most used programming language
CSS
56736 projects

Projects that are alternatives of or similar to covid19antiasian

rcvr-app
recover provides localities a privacy-compliant, safe, and easy way for their guests to check in. See https://www.recoverapp.de/ for more details.
Stars: ✭ 43 (+230.77%)
Mutual labels:  covid-19
covid19-philadelphia
De-identified, aggregate datasets showing COVID-19 cases, hospitalizations, deaths and vaccinations by date, zip, or age/sex/race as made available by the City of Philadelphia through its Open Data Program.
Stars: ✭ 15 (+15.38%)
Mutual labels:  covid-19
impfbot
Benachrichtigungs-Bot für das niedersächische Impfportal / Notification bot for the lower saxony vaccination portal https://impfportal-niedersachsen.de
Stars: ✭ 37 (+184.62%)
Mutual labels:  covid-19
PhoNER COVID19
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Stars: ✭ 55 (+323.08%)
Mutual labels:  covid-19
CovidCountyDash.jl
covid-county-dash.herokuapp.com
Stars: ✭ 26 (+100%)
Mutual labels:  covid-19
canairio firmware
CanAirIO is a citizen science initiative. We are developing a mobile application that is able to set a PM2.5 or CO2 sensors, and other related sensors, as a fixed station using WiFi or mobile station with an smartphone by using a Bluetooth connection.
Stars: ✭ 84 (+546.15%)
Mutual labels:  covid-19
stop-covid19-sfbayarea
Publish COVID-19 data and FAQ local to the Bay Area
Stars: ✭ 26 (+100%)
Mutual labels:  covid-19
vue-covid
VueJS client untuk menampilkan data Covid19
Stars: ✭ 16 (+23.08%)
Mutual labels:  covid-19
covid19-time-series-utilities
several utilities to help wrangle COVID-19 data into a time-series format
Stars: ✭ 34 (+161.54%)
Mutual labels:  covid-19
brazil-civil-registry-data
Raw scrapings of ARPEN https://transparencia.registrocivil.org.br/
Stars: ✭ 35 (+169.23%)
Mutual labels:  covid-19
NYState-COVID-19-Tracker
COVID-19 positive cases tracker for New York State.
Stars: ✭ 14 (+7.69%)
Mutual labels:  covid-19
coronainfobd
Real-time corona-virus tracker of Bangladesh 🇧🇩 which includes latest updates, data visualization, public awareness from WHO and some advice to aware people. 🥰❤
Stars: ✭ 46 (+253.85%)
Mutual labels:  covid-19
Plasma-Donor-App
An open-source app that helps in connecting patients and plasma donors. This is a beginner-friendly repository that helps you learn the basics of android development, git, and GitHub. Happy Hacktober!
Stars: ✭ 58 (+346.15%)
Mutual labels:  covid-19
covid19-timeseries
Covid19 timeseries data store
Stars: ✭ 38 (+192.31%)
Mutual labels:  covid-19
COVID-19-tracker
北航大数据高精尖中心研究团队进行数据来源的整理与获取,利用自然语言处理等技术从已公开全国4626确诊患者轨迹中抽取了基本信息(性别、年龄、常住地、工作、武汉/湖北接触史等)、轨迹(时间、地点、交通工具、事件)及病患关系形成结构化信息
Stars: ✭ 75 (+476.92%)
Mutual labels:  covid-19
covid-19
A web application to display Coronavirus Diseases (COVID19) statistics from different countries.
Stars: ✭ 28 (+115.38%)
Mutual labels:  covid-19
covidAPI
Coronavirus API for Current cases by country COVID-19
Stars: ✭ 600 (+4515.38%)
Mutual labels:  covid-19
path-19
COVID-19 contact tracing mobile app
Stars: ✭ 12 (-7.69%)
Mutual labels:  covid-19
covid-19
An app made with Flutter to track COVID-19 case counts.
Stars: ✭ 47 (+261.54%)
Mutual labels:  covid-19
CovidVaccineNotifier
Get notified with available vaccination centres via SMS
Stars: ✭ 21 (+61.54%)
Mutual labels:  covid-19

Replication data and code for Study 1 (Social Media Data Analysis)

Author: Jae Yeon Kim ([email protected])

Paper: https://osf.io/preprints/socarxiv/dvm7r/ (accepted at Perspectives on Politics)

Session information

  1. Programming languages
  • R version 4.0.4 (2021-02-15)
  • Python 3.8.8
  • Bash 5.1.4(1)-release
  1. Operation system
  • Platform: x86_64-pc-linux-gnu (64-bit)
  • Running under: Ubuntu 21.04

Data collection

Raw data: tweet_ids

The data source is the large-scale COVID-19 Twitter chatter dataset (v.15) created by Panacealab. The original dataset only provided tweet IDs, not tweets, following Twitter's developer terms. I turned these tweet IDs back into a JSON file (tweets) using Twarc. This process is called hydrating and is very time-consuming. To ease the process, I created an R package, called tidytweetjson, that efficiently parses this large JSON file into a tidyverse-ready data frame. To help replication, I also saved the IDs of the tweets by typing the following command in the terminal: grep "INFO archived" twarc.log | awk '{print $5}' > tweet_ids

Replication code

  • 00_setup.sh: Shell script for collecting Tweets and their related metadata based on Tweet IDs

  • 01_google_trends.r: R script for collecting Google search API data

  • 01_sample.Rmd: R markdown file for sampling Twitter data

  • 02_parse.r: R script for parsing Twitter data. This script produced a cleaned and wrangled data named 'parsed.rds.' This file is not included in this repository to not violate Twitter's Developer Terms. Also, its file size is quite large (1.4 GB).

Descriptive analysis

Replication code

  • 03_explore.Rmd: R markdown file for further wrangling and exploring data. This file creates Figure 2. (overall_trend.png)

  • 04_01_hashtags.R: R script file for creating a wordlcoud of hashtags. This file creates Figure 1. (hash_cloud.png)

  • 04_clean.ipynb: Python notebook for cleaning texts

Topic modeling

Replication code

  • 05_topic_modeling.Rmd: R markdown for topic modeling analysis. This file creates Figure 3 (dynamic_topic_day.png)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].