All Projects → broadinstitute → pyro-cov

broadinstitute / pyro-cov

Licence: Apache-2.0 license
Pyro models of SARS-CoV-2 variants

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
TeX
3793 projects
r
7636 projects
Makefile
30231 projects
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to pyro-cov

cli-corona
📈 Track COVID-19 (2019 novel Coronavirus) statistics via the command line.
Stars: ✭ 14 (-64.1%)
Mutual labels:  epidemiology, sars-cov-2
region-plot
A tool to plot significant regions of GWAS
Stars: ✭ 20 (-48.72%)
Mutual labels:  genetics
JUNE
June is a framework for agent based modelling in an epidemiological and geographical context.
Stars: ✭ 28 (-28.21%)
Mutual labels:  epidemiology
graphsim
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Stars: ✭ 16 (-58.97%)
Mutual labels:  genetics
GeneLab Data Processing
No description or website provided.
Stars: ✭ 32 (-17.95%)
Mutual labels:  genetics
desh-data
Sequence lineage information extracted from RKI sequence data repo
Stars: ✭ 22 (-43.59%)
Mutual labels:  sars-cov-2
poreCov
SARS-CoV-2 workflow for nanopore sequence data
Stars: ✭ 34 (-12.82%)
Mutual labels:  sars-cov-2
SARS2-Stat-KR
중국 우한 바이러스 한국 확진자 통계
Stars: ✭ 25 (-35.9%)
Mutual labels:  sars-cov-2
covid-19-prediction
[IoT'20] Predicting the Growth and Trend of COVID-19 Pandemic using Machine Learning and Cloud Computing
Stars: ✭ 28 (-28.21%)
Mutual labels:  sars-cov-2
serratus
Ultra-deep search for novel viruses
Stars: ✭ 196 (+402.56%)
Mutual labels:  sars-cov-2
SARS-CoV-2-Sequenzdaten aus Deutschland
Das Robert Koch-Institut stellt Systeme zur bundesweiten molekularen Surveillance des SRARS-CoV-2-Virus bereit. Jedes Labor in Deutschland, das SARS-CoV-2 sequenziert, ist laut der Verordnung zur molekulargenetischen Surveillance des Coronavirus SARS-CoV-2 verpflichtet, dem Robert Koch-Institut die Sequenz- und zugehörige Metadaten zu übermittel…
Stars: ✭ 66 (+69.23%)
Mutual labels:  sars-cov-2
farolcovid
🚦🏥. Ferramenta de monitoramento do risco de colapso no sistema de saúde em municípios brasileiros com a Covid-19 • Monitoring tool & simulation of the risk of collapse in Brazilian municipalities' health system due to Covid-19
Stars: ✭ 49 (+25.64%)
Mutual labels:  epidemiology
dee2
Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (-17.95%)
Mutual labels:  genetics
SARS-CoV-2-Contextual-Data-Specification
Collection template and associated materials for SARS-CoV-2 metadata
Stars: ✭ 26 (-33.33%)
Mutual labels:  sars-cov-2
HumanIdiogramLibrary
Resource of human chromosome schematics & images
Stars: ✭ 76 (+94.87%)
Mutual labels:  genetics
epidemic-simulator
A HTML/JavaScript simulator for an epidemc on a population
Stars: ✭ 23 (-41.03%)
Mutual labels:  sars-cov-2
AlgebraicPetri.jl
Build Petri net models compositionally
Stars: ✭ 49 (+25.64%)
Mutual labels:  epidemiology
links
Collection of links and information about COVID-19
Stars: ✭ 12 (-69.23%)
Mutual labels:  sars-cov-2
covid
MolSSI SARS-CoV-2 Biomolecular Simulation Data and Algorithm Store
Stars: ✭ 24 (-38.46%)
Mutual labels:  sars-cov-2
COVID19MagyarEpi
A magyarországi koronavírus járvány valós idejű, kvantitatív epidemiológiája.
Stars: ✭ 19 (-51.28%)
Mutual labels:  epidemiology

Github Release DOI

Pyro models for SARS-CoV-2 analysis

Overview

Supporting material for the paper "Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness" (medRxiv). Figures and supplementary data for that paper are in the paper/ directory.

This is open source, but we are not intending to support code for use by outside groups. To use outputs of this model, we recommend ingesting the tables strains.tsv and mutations.tsv.

Reproducing

Install software

Clone this repository:

git clone [email protected]:broadinstitute/pyro-cov
cd pyro-cov

Install this python package:

pip install -e .

Get access to GISAID data

Work with GISAID to get a data agreement. Define the following environment variables:

GISAID_USERNAME
GISAID_PASSWORD
GISAID_FEED

For example my username is fritz and my gisaid feed is broad2.

Download data

This downloads data from GISAID and clones repos for other data sources.

make update

Preprocess data

This takes under an hour. Results are cached in the results/ directory, so re-running on newly pulled data should be able to re-use alignment and PANGOlineage classification work.

make preprocess

Analyze data

make analyze

Generate plots and tables

Plots and tables are generated by running various notebooks:

Citing

If you use this software or predictions in the paper directory please consider citing:

@article {Obermeyer2021.09.07.21263228,
  author = {Obermeyer, Fritz and
            Schaffner, Stephen F. and
            Jankowiak, Martin and
            Barkas, Nikolaos and
            Pyle, Jesse D. and
            Park, Daniel J. and
            MacInnis, Bronwyn L. and
            Luban, Jeremy and
            Sabeti, Pardis C. and
            Lemieux, Jacob E.},
  title = {Analysis of 2.1 million SARS-CoV-2 genomes identifies mutations associated with transmissibility},
  elocation-id = {2021.09.07.21263228},
  year = {2021},
  doi = {10.1101/2021.09.07.21263228},
  publisher = {Cold Spring Harbor Laboratory Press},
  URL = {https://www.medrxiv.org/content/early/2021/09/13/2021.09.07.21263228},
  eprint = {https://www.medrxiv.org/content/early/2021/09/13/2021.09.07.21263228.full.pdf},
  journal = {medRxiv}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].