All Projects → aws-samples → aws-genomics-workflows

aws-samples / aws-genomics-workflows

Licence: MIT-0 license
Genomics Workflows on AWS

Programming Languages

shell
77523 projects
typescript
32286 projects
python
139335 projects - #7 most used programming language
Dockerfile
14818 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to aws-genomics-workflows

workflows
Bioinformatics workflows developed for and used on the St. Jude Cloud project.
Stars: ✭ 16 (-87.79%)
Mutual labels:  genomics, workflows
bodywork-ml-pipeline-project
Deployment template for a continuous training pipeline.
Stars: ✭ 22 (-83.21%)
Mutual labels:  batch
Hap.py
Haplotype VCF comparison tools
Stars: ✭ 249 (+90.08%)
Mutual labels:  genomics
Mitty
Seven Bridges Genomics aligner/caller debugging and analysis tools
Stars: ✭ 13 (-90.08%)
Mutual labels:  genomics
attr-gather
Hit a million different APIs and combine the results in one simple hash (without pulling your hair out). A simple workflow system to gather aggregate attributes for something.
Stars: ✭ 30 (-77.1%)
Mutual labels:  workflows
berokka
🍊 💫 Trim, circularise and orient long read bacterial genome assemblies
Stars: ✭ 23 (-82.44%)
Mutual labels:  genomics
Biopython
Official git repository for Biopython (originally converted from CVS)
Stars: ✭ 2,936 (+2141.22%)
Mutual labels:  genomics
metaRNA
Find target sites for the miRNAs in genomic sequences
Stars: ✭ 19 (-85.5%)
Mutual labels:  genomics
BAT FFMPEG
Batch script files for FFMPEG (Microsoft Windows and DOS, OS/2 🦄)
Stars: ✭ 104 (-20.61%)
Mutual labels:  batch
alfred-workflow-zoom-meetings
Join a Zoom meeting without the link opening in your browser.
Stars: ✭ 45 (-65.65%)
Mutual labels:  workflows
JSBatchobfuscator
JSBatchobfuscator is a simple obfuscator for batch script
Stars: ✭ 47 (-64.12%)
Mutual labels:  batch
fermi
A WGS de novo assembler based on the FMD-index for large genomes
Stars: ✭ 74 (-43.51%)
Mutual labels:  genomics
MGSE
Mapping-based Genome Size Estimation (MGSE) performs an estimation of a genome size based on a read mapping to an existing genome sequence assembly.
Stars: ✭ 22 (-83.21%)
Mutual labels:  genomics
cljam
A DNA Sequence Alignment/Map (SAM) library for Clojure
Stars: ✭ 85 (-35.11%)
Mutual labels:  genomics
GenomicsDB
Highly performant data storage in C++ for importing, querying and transforming variant data with C/C++/Java/Spark bindings. Used in gatk4.
Stars: ✭ 77 (-41.22%)
Mutual labels:  genomics
Canvasxpress
JavaScript VisualizationTools
Stars: ✭ 247 (+88.55%)
Mutual labels:  genomics
HLA
xHLA: Fast and accurate HLA typing from short read sequence data
Stars: ✭ 84 (-35.88%)
Mutual labels:  genomics
kmer-db
Kmer-db is a fast and memory-efficient tool for large-scale k-mer analyses (indexing, querying, estimating evolutionary relationships, etc.).
Stars: ✭ 68 (-48.09%)
Mutual labels:  genomics
spring-batch-rest
REST API for Spring Batch using Spring Boot 2.2
Stars: ✭ 85 (-35.11%)
Mutual labels:  batch
cerebra
A tool for fast and accurate summarizing of variant calling format (VCF) files
Stars: ✭ 55 (-58.02%)
Mutual labels:  genomics

Genomics Workflows on AWS

Build Status

This repository is the source code for Genomics Workflows on AWS. It contains markdown documents that are used to build the site as well as source code (CloudFormation templates, scripts, etc) that can be used to deploy AWS infrastructure for running genomics workflows.

If you want to get the latest version of these solutions up and running quickly, it is recommended that you deploy stacks using the launch buttons available via the hosted guide.

If you want to customize these solutions, you can create your own distribution using the instructions below.

Creating your own distribution

Clone the repo

git clone https://github.com/aws-samples/aws-genomics-workflows.git

Create an S3 bucket in your AWS account to use for the distribution deployment

aws s3 mb <dist-bucketname>

Create and deploy a distribution from source

cd aws-genomics-workflows
bash _scripts/deploy.sh --deploy-region <region> --asset-profile <profile-name> --asset-bucket s3://<dist-bucketname> test

This will create a dist folder in the root of the project with subfolders dist/artifacts and dist/templates that will be uploaded to the S3 bucket you created above.

Use --asset-profile option to specify an AWS profile to use to make the deployment.

Note: the region set for --deploy-region should match the region the bucket <dist-bucketname> is created in.

You can now use your deployed distribution to launch stacks using the AWS CLI. For example, to launch the GWFCore stack:

TEMPLATE_ROOT_URL=https://<dist-bucketname>.s3-<region>.amazonaws.com/test/templates

aws cloudformation create-stack \
    --region <region> \
    --stack-name <stackname> \
    --template-url $TEMPLATE_ROOT_URL/gwfcore/gwfcore-root.template.yaml \
    --capabilities CAPABILITY_IAM CAPABILITY_AUTO_EXPAND \
    --parameters \
        ParameterKey=VpcId,ParameterValue=<vpc-id> \
        ParameterKey=SubnetIds,ParameterValue=\"<subnet-id-1>,<subnet-id-2>,...\" \
        ParameterKey=ArtifactBucketName,ParameterValue=<dist-bucketname> \
        ParameterKey=TemplateRootUrl,ParameterValue=$TEMPLATE_ROOT_URL \
        ParameterKey=S3BucketName,ParameterValue=<store-buketname> \
        ParameterKey=ExistingBucket,ParameterValue=false

Shared File System Support

Amazon EFS is supported out of the box for GWFCore and Nextflow. You have two options to use EFS.

  1. Create a new EFS File System: Be sure to have CreateEFS set to Yes and also include the total number of subnets.
  2. Use an Existing EFS File System: Be sure to specify the EFS ID in the ExistingEFS parameter. This file system should be accessible from every subnet you specify.

Following successful deployment of GWFCore, when creating your Nextflow Resources, set MountEFS to Yes.

Building the documentation

The documentation is built using mkdocs.

Install dependencies:

$ conda env create --file environment.yaml

This will create a conda environment called mkdocs

Build the docs:

$ conda activate mkdocs
$ mkdocs build

License Summary

This library is licensed under the MIT-0 License. See the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].