All Projects → pdtyreus → coronavirus-ds

pdtyreus / coronavirus-ds

Licence: MIT license
Jupyter notebooks and python scripts for investigating the 2019 coronavirus outbreak

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to coronavirus-ds

coronavirus-data
This repository contains data on Coronavirus Disease 2019 (COVID-19) in New York City (NYC), from the NYC Department of Health and Mental Hygiene.
Stars: ✭ 926 (+2887.1%)
Mutual labels:  coronavirus
cocoa-documentation
接触確認アプリ COCOA の非公式ドキュメント
Stars: ✭ 51 (+64.52%)
Mutual labels:  coronavirus
Covid-19-API
A realtime API for coronavirus cases on Heroku. Data automatically updated every 10 minutes!
Stars: ✭ 59 (+90.32%)
Mutual labels:  coronavirus
Cough-signal-processing
Different methods and techniques for features extraction from audio
Stars: ✭ 42 (+35.48%)
Mutual labels:  coronavirus
covid19entucomuna
Simple and user-friendly analysis of coronavirus cases by region and by comuna in Chile.
Stars: ✭ 20 (-35.48%)
Mutual labels:  coronavirus
CoronaVirusOutbreakAPI
A tiny and small program to crawler and analyze outbreak of COVID-19 in world and every country using PHP.
Stars: ✭ 20 (-35.48%)
Mutual labels:  coronavirus
covid19-resources
Curated list of Coronavirus data & apps
Stars: ✭ 35 (+12.9%)
Mutual labels:  coronavirus
covid19-br-info
Coronavirus frontend info about Brazil's states and cities
Stars: ✭ 12 (-61.29%)
Mutual labels:  coronavirus
covid19africa
Africa open COVID-19 data working group
Stars: ✭ 47 (+51.61%)
Mutual labels:  coronavirus
serratus
Ultra-deep search for novel viruses
Stars: ✭ 196 (+532.26%)
Mutual labels:  coronavirus
COVID-19-SG
Singapore & Malaysia COVID-19 data from multiple data sources (Zaobao, MOH)
Stars: ✭ 17 (-45.16%)
Mutual labels:  coronavirus
coviddata
Daily COVID-19 statistics by country, region, and city
Stars: ✭ 49 (+58.06%)
Mutual labels:  coronavirus
COVID19MagyarEpi
A magyarországi koronavírus járvány valós idejű, kvantitatív epidemiológiája.
Stars: ✭ 19 (-38.71%)
Mutual labels:  coronavirus
covid19-pr-api
COVID-19 Open API for Datasets in Puerto Rico
Stars: ✭ 21 (-32.26%)
Mutual labels:  coronavirus
covid19
Visualize and compare COVID 19 growth rates of different countries
Stars: ✭ 22 (-29.03%)
Mutual labels:  coronavirus
movingpandas-examples
Example notebooks illustrating MovingPandas use cases
Stars: ✭ 116 (+274.19%)
Mutual labels:  geopandas
ios
CoThings's iOS application. CoThings is a realtime counter for shared things.
Stars: ✭ 13 (-58.06%)
Mutual labels:  coronavirus
COVID-19-Greece
A python-generated website for visualizing the novel coronavirus (COVID-19) data for Greece.
Stars: ✭ 21 (-32.26%)
Mutual labels:  coronavirus
COVID-19-STAT
A web application to keep track of COVID-19 numbers & growth across the world
Stars: ✭ 19 (-38.71%)
Mutual labels:  coronavirus
coronastats
A simple web app which shows updates about COVID-19
Stars: ✭ 18 (-41.94%)
Mutual labels:  coronavirus

Coronavirus Data Science

This repository contains Jupyter notebooks and python scripts for investigating the 2019 coronavirus outbreak. The goal is to serve as a starting point to track and analyze this outbreak. Getting an environment set up to read, analyze, and plot the outbreak data is not trivial. I am hoping this helps more people get started.

If you are a researcher, journalist, or other interested member of the public, please use this freely. If you are a data scientist, please fork and contribute back to build a better foundation for future research.

Goals

  1. Provide a framework and tools for loading outbreak data into Python
  2. Easily visualize outbreak geodata
  3. Facilitate collaboration among researchers

Background

From the CDC:

2019 Novel Coronavirus (2019-nCoV) is a virus (more specifically, a coronavirus) identified as the cause of an outbreak of respiratory illness first detected in Wuhan, China. Early on, many of the patients in the outbreak in Wuhan, China reportedly had some link to a large seafood and animal market, suggesting animal-to-person spread. However, a growing number of patients reportedly have not had exposure to animal markets, indicating person-to-person spread is occurring. At this time, it’s unclear how easily or sustainably this virus is spreading between people. The latest situation summary updates are available on CDC’s web page 2019 Novel Coronavirus, Wuhan, China.

This is an emerging, rapidly evolving situation and CDC will provide updated information as it becomes available.

Data Sources

The data for tracking the 2019-nCoV outbreak is provided by the Johns Hopkins Center for Systems Science and Engineering. They have created an interactive GIS Dashboard.

In response to this ongoing public health emergency, we developed an online dashboard (static snapshot shown below) to visualize and track the reported cases on a daily timescale; the complete set of data is downloadable as a google sheet. The case data visualized is collected from various sources, including WHO, U.S. CDC, ECDC China CDC (CCDC), NHC and DXY. DXY is a Chinese website that aggregates NHC and local CCDC situation reports in near real-time, providing more current regional case estimates than the national level reporting organizations are capable of, and is thus used for all the mainland China cases reported in our dashboard (confirmed, suspected, recovered, deaths). U.S. cases (confirmed, suspected, recovered, deaths) are taken from the U.S. CDC, and all other country (suspected and confirmed) case data is taken from the corresponding regional health departments. The dashboard is intended to provide the public with an understanding of the outbreak situation as it unfolds, with transparent data sources.

Pulling Updates from Google Sheets

The data is updated in a read-only Google Sheet.

Download credentials and install dependencies as described in the Google documentation..

python pull_gsheet_csse.py

Progress

The Jan 25 Jupyter notebook works on a snapshot of data from Jan 25.

  1. Load the coronavirus data into a Pandas DataFrame and plot
  2. Load world, China, and US shapefiles into GeoDataFrames
  3. Merge the coronavirus DataFrame with the GeoDataFrames
  4. Display on a map

Jan 25

The nCoV Spread Jupyter notebook loads all data files into one time-indexed DataFram.

Dependencies

Jupyter Notebooks

pip install pandas
pip install requests
pip install geopandas
pip install descartes

Short-term Roadmap

  1. Load and visualize a data snapshot
  2. Create a script to download new data from Google Sheets
  3. Visualize time-series data

contributions welcome!

Coronavirus Data Science © 2019+, P. Daniel Tyreus, PhD Released under the MIT License.

Twitter @tyreus

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].