All Projects → simonw → covid-19-datasette

simonw / covid-19-datasette

Licence: other
Deploys a Datasette instance of COVID-19 data from Johns Hopkins CSSE and the New York Times

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to covid-19-datasette

spider-covid-19-viz-3d
Speedy access to the latest, local COVID-19 data with a familiar interface: the Globe
Stars: ✭ 19 (-69.35%)
Mutual labels:  covid-19
backend
Code for statistical methods which estimates outbreak indicators at Covid Today.
Stars: ✭ 13 (-79.03%)
Mutual labels:  covid-19
COVID-19-AI
Collection of AI resources to fight against Coronavirus (COVID-19)
Stars: ✭ 25 (-59.68%)
Mutual labels:  covid-19
reina-model
Agent-based simulation model for COVID-19 spread in society and patient outcomes
Stars: ✭ 30 (-51.61%)
Mutual labels:  covid-19
delivery-finder
To help in COVID-19 situation - An automated bot to find delivery window of InstaCart and Amazon Whole Foods Market, Costco Same Day and Walmart Groceries.
Stars: ✭ 22 (-64.52%)
Mutual labels:  covid-19
Corona-19-API
코로나바이러스감염증-19 API 서비스 제공 - 코로나19 관련 API를 무제한 무료로 사용하세요! / 🇰🇷 South Korea's COVID-19 API service provided - Use COVID-19 related API for free for unlimited !
Stars: ✭ 101 (+62.9%)
Mutual labels:  covid-19
COMOKIT-Model
A GAMA (http://gama-platform.org) model on the assessment and comparisons of intervention policies against the CoVid19 pandemics
Stars: ✭ 23 (-62.9%)
Mutual labels:  covid-19
coronavirusrd
Web app to show information about the current cases of COVID 19 in Dominican Republic
Stars: ✭ 13 (-79.03%)
Mutual labels:  covid-19
COVID-public
Repo to share open work on the COVID-19 outbreak
Stars: ✭ 14 (-77.42%)
Mutual labels:  covid-19
covid19api
Playing with GraphQL and COVID-19 API! Checkout the related COVID-19 Tracker for India.
Stars: ✭ 31 (-50%)
Mutual labels:  covid-19
aarogya seva
A beautiful 😍 covid-19 app with self - assessment and more.
Stars: ✭ 118 (+90.32%)
Mutual labels:  covid-19
covid-19-self-assessment
The tool takes the public through a series of questions to inform those who are concerned they may have contracted COVID-19.
Stars: ✭ 31 (-50%)
Mutual labels:  covid-19
impf-botpy
Impf Bot.py 🐍⚡ – Automatisierung für den Corona ImpfterminService Bot
Stars: ✭ 254 (+309.68%)
Mutual labels:  covid-19
coro2sens
Build a simple device that warns if CO₂ concentration in a room becomes a risk for COVID-19 aerosol infections.
Stars: ✭ 32 (-48.39%)
Mutual labels:  covid-19
Covid-19-analysis
Analysis with Covid-19 data
Stars: ✭ 49 (-20.97%)
Mutual labels:  covid-19
ailing-planet
An interactive timelapse of the COVID-19 pandemic.
Stars: ✭ 21 (-66.13%)
Mutual labels:  covid-19
COVID19
Using Kalman Filter to Predict Corona Virus Spread
Stars: ✭ 78 (+25.81%)
Mutual labels:  covid-19
MoveSim
Codes for paper in KDD 2020 (AI for COVID-19): Learning to Simulate Human Mobility
Stars: ✭ 16 (-74.19%)
Mutual labels:  covid-19
coronavirus
covid-19 data in J
Stars: ✭ 15 (-75.81%)
Mutual labels:  covid-19
China CoronaVirus Data Miner
Collects officially published daily stats for the corona-virus (China only) 收集新冠状病毒中国每日疫情数据,精确到地级
Stars: ✭ 24 (-61.29%)
Mutual labels:  covid-19

covid-19-datasette

Fetch latest data and deploy with Datasette

Deploys a Datasette instance with data from the following sources:

The Datasette instance lives at https://covid-19.datasettes.com/ and is updated hourly using a scheduled GitHub Action.

More about this project on my blog: COVID-19 numbers in Datasette.

This repository uses the deployment pattern described in Deploying a data API using GitHub Actions and Cloud Run.

Using this data responsibly

Please do not use this tool to share information about COVID-19 without making absolutely sure you understand how the data is structured and sourced.

Recommended reading:

Johns Hopkins

The database is partly built from the daily report CSV files in the Johns Hopkins CSSE csse_covid_19_data folder - be sure to consult their README for documentation of the fields.

They are actively making changes to how they report data. You should follow their issues closely for updates - for example this issue about switching from reporting USA data at the county to the state level.

The build script for the database makes one alteration to their data: it attempts to fill any missing latitude and longitude columns with values from similar rows.

If you are going to make use of those columns, make sure you understand how that backfill mechanism works in case it affects your calculations in some way.

The New York Times

The New York Times has a comprehensive README describing how their data is sourced. You should read it! They announced their data in We’re Sharing Coronavirus Case Data for Every U.S. County.

They are using the data for their Coronavirus in the U.S.: Latest Map and Case Count article.

The Los Angeles Times

The Los Angeles Time comprehensive README describes the data in the latimes_agency_totals, latimes_county_totals, latimes_place_totals and latimes_state_totals tables.

See To aid coronavirus fight, The Times releases database of California cases for background on the release of this data.

The data is used for their Tracking coronavirus in California page, which is constantly updated.

The Economist

The Economist publish the data behind their ongoing interactive Tracking covid-19 excess deaths across countries. Their README describes the data sources they use for individual countries in detail.

This data is imported into the economist_excess_deaths and economist_historical_deaths tables, with one alteration: a cadence column is added showing if each row is being collected on either a weekly or monthly basis.

US Census

Two additional tables contain population data from the US Census.

This repository includes CSV data for both of these tables.

The latest_ny_times_counties_with_populations view uses this data to calculate cases and deaths per million for US counties, based on the latest county figures from the New York Times.

Example issues

  • Remember: the number of reported cases is very heavily influenced by the availability of testing.
  • This Twitter thread is an excellent overview of the challenges involved in comparing numbers from different states and countries.
  • On the 23rd March 2020 Johns Hopkins added four new columns to the daily CSV file: admin2, fips, active and combined_key. These are not present in older CSV files. #4.
  • Some countries (like Italy) are represented by just the rows with country_or_region set to Italy (and province_or_state set to null). Larger countries such as the United States have multiple rows for each day divided into separate province_or_state values - example.
  • Santa Clara County appears to be represented as Santa Clara, CA in some records and Santa Clara County, CA in others - example.
  • Passengers from the Diamond Princess cruise are represented by a number of different rows with "From Diamond Princess" in their province_or_state column - example.
  • The latest_ny_times_counties_with_populations view omits some counties, notably all New York City counties, because the New York Times groups all New York City data into rows with county equal to "New York City" and an empty fips column. Thus total cases represented in latest_ny_times_counties_with_populations are lower than total cases represented in ny_times_us_states by at least the number of cases in New York City.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].