All Projects → GoekeLab → bioinformatics-workflows

GoekeLab / bioinformatics-workflows

Licence: MIT license
minimal example implementations for bioinformatics workflow managers

Programming Languages

python
139335 projects - #7 most used programming language
go
31211 projects - #10 most used programming language
wdl
31 projects
Nextflow
61 projects
Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to bioinformatics-workflows

workflow-svg.js
JS Library to present and edit workflows as SVG
Stars: ✭ 30 (-85.15%)
Mutual labels:  workflow-management
hera-workflows
Hera is an Argo Workflows Python SDK. Hera aims to make workflow construction and submission easy and accessible to everyone! Hera abstracts away workflow setup details while still maintaining a consistent vocabulary with Argo Workflows.
Stars: ✭ 252 (+24.75%)
Mutual labels:  workflow-management
hubkit
Project maintainance tool for GitHub repositories
Stars: ✭ 27 (-86.63%)
Mutual labels:  workflow-management
Awesome Apache Airflow
Curated list of resources about Apache Airflow
Stars: ✭ 2,755 (+1263.86%)
Mutual labels:  workflow-management
Temporal
Temporal service
Stars: ✭ 3,212 (+1490.1%)
Mutual labels:  workflow-management
Conductor
Conductor is a microservices orchestration engine.
Stars: ✭ 3,759 (+1760.89%)
Mutual labels:  workflow-management
SciFlow
Scientific workflow management
Stars: ✭ 49 (-75.74%)
Mutual labels:  workflow-management
goobi-workflow
Goobi workflow - Workflow management software for digitisation projects used in more than 70 cultural heritage institutions in at least 17 countries.
Stars: ✭ 43 (-78.71%)
Mutual labels:  workflow-management
taska
Workflow Management for Biomedical exploration
Stars: ✭ 29 (-85.64%)
Mutual labels:  workflow-management
workflowmanager-viewer-js
Source code for ArcGIS Workflow Manager JavaScript viewer - Manage your workflows on the web.
Stars: ✭ 23 (-88.61%)
Mutual labels:  workflow-management
unify-flowret
A lightweight Java based orchestration engine
Stars: ✭ 57 (-71.78%)
Mutual labels:  workflow-management
workflow-manager
Minimal Workflow orchestrator for AWS Step Functions
Stars: ✭ 20 (-90.1%)
Mutual labels:  workflow-management
Geoweaver
a web system to allow users to automatically record history and manage complicated scientific workflows in web browsers involving the online spatial data facilities, high-performance computation platforms, and open-source libraries.
Stars: ✭ 32 (-84.16%)
Mutual labels:  workflow-management
mercury-ml
Mercury-ML is an open source Machine Learning workflow management library. Its core contributors are employees of Alexander Thamm GmbH
Stars: ✭ 37 (-81.68%)
Mutual labels:  workflow-management
datajoint-python
Relational data pipelines for the science lab
Stars: ✭ 140 (-30.69%)
Mutual labels:  workflow-management
checks-out
Checks-Out pull request approval system
Stars: ✭ 79 (-60.89%)
Mutual labels:  workflow-management
prefect-docker-compose
A simple guide to understand Prefect and make it work with your own docker-compose configuration.
Stars: ✭ 122 (-39.6%)
Mutual labels:  workflow-management
workflows
Bioinformatics workflows developed for and used on the St. Jude Cloud project.
Stars: ✭ 16 (-92.08%)
Mutual labels:  bioinformatics-workflows

Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

GitHub release (latest SemVer)

Workflow managers provide an easy and intuitive way to simplify pipeline development. Here we provide basic proof-of-concept implementations for selected workflow managers. The analysis workflow is based on a small portion of an RNA-seq pipeline, using fastqc for quality controls and salmon for transcript quantification. These implementations are designed for basic illustrations. Workflow managers provide many more powerful features than what we use here, please visit the official documentations to explore those in detail.

The RNA-Seq workflow

The RNA-Seq analysis workflow performs quality controls with fastqc and quantifies transcripts expression using Salmon. Here we will use local installation (see documentation for salmon and fastqc). For the local installations you can add a symbolic link to the executables to your $PATH:

sudo ln -s /absolute/path/salmon/bin/salmon /usr/local/bin/salmon

sudo ln -s /absolute/path/FastQC/fastqc /usr/local/bin/fastqc

You can test the installation using the help function of the two tools (i.e. salmon -h and fastqc -h).

Test Data

This repository contains a simulated test data set which can be used to run the example implementations. The test data contains RNA-Seq reads (reads_1.fq.gz and reads_2.fq.gz), a transcriptome reference file (transcriptome.fa) and the true counts from the simulation experiments (truth.tsv)

Basic proof-of-concept implementations

Each workflow manager folder in this repository has a README detailing how to run the proof-of-concept pipeline. Implementations that are contributed and reviewed by developers from these workflow management systems are marked with (see Acknowledgements):

Online Documentation for Workflow managers

Workflow managers have many more features which are not used in these implementations, and there are many additional workflow managers. You can read more about each workflow manager in their official documentation:

About this repository

The repository was created to illustrate features of workflow managers that are discussed in detail in this manuscript:

Wratten, L., Wilm, A. & Göke, J. Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers. Nat Methods (2021). https://doi.org/10.1038/s41592-021-01254-9 (Full text link: https://rdcu.be/cyjRN)

Contact and Call for Contribution

This repository was created by Laura Wratten. We very much encourage contributions by users of these workflows. If you would like to add an implementation for any of these workflow managers you can follow the template. If you would like to suggest changes to any of the existing implementations, please raise an issue and submit a pull request.

Acknowledgements

We would like to thank the following people for their contribution to this repository:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].