All Projects → gagneurlab → drop

gagneurlab / drop

Licence: MIT license
Pipeline to find aberrant events in RNA-Seq data, useful for diagnosis of rare disorders

Programming Languages

python
139335 projects - #7 most used programming language
r
7636 projects
shell
77523 projects

Projects that are alternatives of or similar to drop

rnafusion
RNA-seq analysis pipeline for detection gene-fusions
Stars: ✭ 72 (+4.35%)
Mutual labels:  rna-seq, pipeline
biojupies
Automated generation of tailored bioinformatics Jupyter Notebooks via a user interface.
Stars: ✭ 96 (+39.13%)
Mutual labels:  rna-seq, pipeline
grape-nf
An automated RNA-seq pipeline using Nextflow
Stars: ✭ 30 (-56.52%)
Mutual labels:  rna-seq, pipeline
NGI-RNAseq
Nextflow RNA-Seq Best Practice analysis pipeline, used at the SciLifeLab National Genomics Infrastructure.
Stars: ✭ 50 (-27.54%)
Mutual labels:  rna-seq, pipeline
tailseeker
Software for measuring poly(A) tail length and 3′-end modifications using a high-throughput sequencer
Stars: ✭ 17 (-75.36%)
Mutual labels:  rna-seq, pipeline
RNASeq
RNASeq pipeline
Stars: ✭ 30 (-56.52%)
Mutual labels:  rna-seq, pipeline
dolphinnext
A graphical user interface for distributed data processing of high throughput genomics
Stars: ✭ 92 (+33.33%)
Mutual labels:  rna-seq, pipeline
kallistobustools
kallisto | bustools workflow for pre-processing single-cell RNA-seq data
Stars: ✭ 79 (+14.49%)
Mutual labels:  rna-seq
Pulsar
Open source VFX pipeline tool
Stars: ✭ 20 (-71.01%)
Mutual labels:  pipeline
multibranch-action-triggers-plugin
MultiBranch Actions Trigger Plugin
Stars: ✭ 29 (-57.97%)
Mutual labels:  pipeline
pipeline
Spline is a tool that is capable of running locally as well as part of well known pipelines like Jenkins (Jenkinsfile), Travis CI (.travis.yml) or similar ones.
Stars: ✭ 29 (-57.97%)
Mutual labels:  pipeline
reskit
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
Stars: ✭ 27 (-60.87%)
Mutual labels:  pipeline
plumbing
This repo holds configuration for infrastructure used across the tektoncd org 🏗️
Stars: ✭ 41 (-40.58%)
Mutual labels:  pipeline
PECA
PECA is a software for inferring context specific gene regulatory network from paired gene expression and chromatin accessibility data
Stars: ✭ 31 (-55.07%)
Mutual labels:  rna-seq
GREIN
GREIN : GEO RNA-seq Experiments Interactive Navigator
Stars: ✭ 40 (-42.03%)
Mutual labels:  rna-seq
haystack bio
Haystack: Epigenetic Variability and Transcription Factor Motifs Analysis Pipeline
Stars: ✭ 42 (-39.13%)
Mutual labels:  rna-seq
bookmarks
A PySide2 based file and asset manager for animation and CG productions.
Stars: ✭ 33 (-52.17%)
Mutual labels:  pipeline
nodejs-docker-example
An example of how to run a Node.js project in Docker in a Buildkite pipeline
Stars: ✭ 39 (-43.48%)
Mutual labels:  pipeline
scrnaseq
A single-cell RNAseq pipeline for 10X genomics data
Stars: ✭ 60 (-13.04%)
Mutual labels:  pipeline
eventkit
Event-driven data pipelines
Stars: ✭ 94 (+36.23%)
Mutual labels:  pipeline

Detection of RNA Outlier Pipeline

DROP pipeline status Version Version

The detection of RNA Outliers Pipeline (DROP) is an integrative workflow to detect aberrant expression, aberrant splicing, and mono-allelic expression from raw sequencing files.

The manuscript is available in Nature Protocols. SharedIt link.

drop logo

What's new

Version 1.2.2 fixes some critical bugs that affected the performance of the aberrantExpression pipeline, and allows sample IDs to be numeric.

As of version 1.2.1 DROP has a new module that performs RNA-seq variant calling. The input are BAM files and the output either a single-sample or a multi-sample VCF file (option specified by the user) annotated with allele frequencies from gnomAD (if specified by the user). The sample annotation table does not need to be changed, but several new parameters in the config file have to be added and tuned. For more info, refer to the documentation.

Also, the integration of external split and non-split counts to detect aberrant splicing is now possible. Simply specify in a new column in the sample annotation the directory containing the counts. For more info, refer to the documentation.

Quickstart

DROP is available on bioconda. We recommend using a dedicated conda environment. (installation time: ~ 10min)

mamba create -n drop_env -c conda-forge -c bioconda drop --override-channels

Test installation with demo project

mkdir ~/drop_demo
cd ~/drop_demo
drop demo

The pipeline can be run using snakemake commands

snakemake -n # dryrun
snakemake --cores 1

Expected runtime: 25 min

For more information on different installation options, refer to the documentation

Set up a custom project

Install the drop module according to installation and initialize the project in a custom project directory.

Prepare the input data

Create a sample annotation that contains the sample IDs, file locations and other information necessary for the pipeline. Edit the config file to set the correct file path of sample annotation and locations of non-sample specific input files. The requirements are described in the documentation.

Execute the pipeline

Once these files are set up, you can execute a dry run from your project directory

snakemake -n

This shows you the rules of all subworkflows. Omit -n and specify the number of cores with --cores if you are sure that you want you execute all printed rules. You can also invoke single workflows explicitly e.g. for aberrant expression with:

snakemake aberrantExpression --cores 10

Datasets

The following publicly-available datasets of gene counts can be used as controls. Please cite as instructed for each dataset.

  • 154 non-strand specific fibroblasts, build hg19, Technical University of Munich: DOI

  • 269 strand specific fibroblasts, build hg19, Technical University of Munich: DOI

  • 49 tissues, each containing hundreds of samples, non-strand specific, build hg19, GTEx: DOI

  • 49 tissues, each containing hundreds of samples, non-strand specific, build hg38, GTEx: DOI

  • 139 strand specific fibroblasts, build hg19, Baylor College of Medicine: DOI

  • 125 strand specific blood, build hg19, Baylor College of Medicine: DOI

If you want to contribute with your own count matrices, please contact us: yepez at in.tum.de

Citation

If you use DROP in research, please cite our manuscript.

Furthermore, if you use the aberrant expression module, also cite OUTRIDER; if you use the aberrant splicing module, also cite FRASER; and if you use the MAE module, also cite the Kremer, Bader et al study and DESeq2.

For the complete set of tools used by DROP (e.g. for counting), see the manuscript.

Acknowledgements and Funding

The DROP team is composed of members from the Gagneur lab at the Department of Informatics and School of Medicine of the Technical University of Munich (TUM) and The German Human Genome-Phenome Archive (GHGA). The team has been funded by the German Bundesministerium für Bildung und Forschung (BMBF) through the e:Med Networking fonds AbCD-Net, Medical Informatics Initiative CORD-MI, and ERA PerMed project PerMiM. We would like to thank all the users for their feedback.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].