tibannaTibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integration Center) to process data. Tibanna supports CWL/WDL (w/ docker), Snakemake (w/ conda) and custom Docker/shell command.
Stars: ✭ 61 (+110.34%)
workflowsBioinformatics workflows developed for and used on the St. Jude Cloud project.
Stars: ✭ 16 (-44.83%)
ArvadosAn open source platform for managing and analyzing biomedical big data
Stars: ✭ 274 (+844.83%)
ScipipeRobust, flexible and resource-efficient pipelines using Go and the commandline
Stars: ✭ 826 (+2748.28%)
GlobalbioticinteractionsGlobal Biotic Interactions provides access to existing species interaction datasets
Stars: ✭ 71 (+144.83%)
CuneiformCuneiform distributed programming language
Stars: ✭ 175 (+503.45%)
NextflowA DSL for data-driven computational pipelines
Stars: ✭ 1,337 (+4510.34%)
TeamTeriGenomics using open source tools, running on GCP or AWS
Stars: ✭ 30 (+3.45%)
GalaxyData intensive science for everyone.
Stars: ✭ 812 (+2700%)
wdl2cwl[Experimental] Workflow Definition Language (WDL) to CWL
Stars: ✭ 26 (-10.34%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+31.03%)
img ai app boilerplateAn image classification app boilerplate to serve your deep learning models asap!
Stars: ✭ 27 (-6.9%)
dysgudysgu-SV is a collection of tools for calling structural variants using short or long reads
Stars: ✭ 47 (+62.07%)
CANDOComputational Analysis of Novel Drug Opportunities
Stars: ✭ 27 (-6.9%)
Scaff10XPipeline for scaffolding and breaking a genome assembly using 10x genomics linked-reads
Stars: ✭ 21 (-27.59%)
paccmann datasetspytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/
Stars: ✭ 15 (-48.28%)
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-27.59%)
faster lmm dA faster lmm for GWAS. Supports GPU backend.
Stars: ✭ 12 (-58.62%)
tf-freeCreate cloud-native resources on all the major cloud providers, completely free of charge. This project is currently under heavy development.
Stars: ✭ 107 (+268.97%)
sample-sheetA permissively licensed library designed to replace Illumina's Experiment Manager
Stars: ✭ 42 (+44.83%)
Binning refinerImproving genome bins through the combination of different binning programs
Stars: ✭ 26 (-10.34%)
siriusSIRIUS is a software for discovering a landscape of de-novo identification of metabolites using tandem mass spectrometry. This repository contains the code of the SIRIUS Software (GUI and CLI)
Stars: ✭ 32 (+10.34%)
infracost-gh-actionGitHub Action for Infracost. Shows cloud cost estimates for Terraform in pull requests.
Stars: ✭ 119 (+310.34%)
emg-viral-pipelineVIRify: detection of phages and eukaryotic viruses from metagenomic and metatranscriptomic assemblies
Stars: ✭ 38 (+31.03%)
rocketjobRuby's missing background and batch processing system
Stars: ✭ 281 (+868.97%)
awesome-phagesA curated list of phage related software and computational resources for phage scientists, bioinformaticians and enthusiasts.
Stars: ✭ 14 (-51.72%)
PHATPathogen-Host Analysis Tool - A modern Next-Generation Sequencing (NGS) analysis platform
Stars: ✭ 17 (-41.38%)
gcf-packsLibrary packs for google cloud functions
Stars: ✭ 48 (+65.52%)
cwl-tsTypescript data model for Common Workflow Language
Stars: ✭ 42 (+44.83%)
protwisProtwis is the backbone of the GPCRdb. The GPCRdb contains reference data, interactive visualisation and experiment design tools for G protein-coupled receptors (GPCRs).
Stars: ✭ 20 (-31.03%)
ensembl-comparaThe Ensembl Compara Perl API and SQL schema
Stars: ✭ 43 (+48.28%)
dtmA distributed transaction framework that supports multiple languages, supports saga, tcc, xa, 2-phase message, outbox patterns.
Stars: ✭ 6,110 (+20968.97%)
jgi-queryA simple command-line tool to download data from Joint Genome Institute databases
Stars: ✭ 38 (+31.03%)
BridgeDbThe BridgeDb Library source code
Stars: ✭ 22 (-24.14%)
rkmhClassify sequencing reads using MinHash.
Stars: ✭ 42 (+44.83%)
associate-cloud-engineerResources on preparing for Google Cloud Associate Cloud Engineer certification
Stars: ✭ 142 (+389.66%)
gnparserGNparser normalises scientific names and extracts their semantic elements.
Stars: ✭ 26 (-10.34%)
VIRTUSA bioinformatics pipeline for viral transcriptome detection and quantification considering splicing.
Stars: ✭ 28 (-3.45%)
zenaton-node⚡ Node.js library to run and orchestrate background jobs with Zenaton Workflow Engine
Stars: ✭ 50 (+72.41%)
ipython2cwlIPython2CWL is a tool for converting IPython Jupyter Notebooks to CWL Command Line Tools by simply providing typing annotation.
Stars: ✭ 15 (-48.28%)
CeleScopeSingle Cell Analysis Pipelines
Stars: ✭ 36 (+24.14%)
gcb-visualizerCloudbuild pipeline visualizer with graphviz
Stars: ✭ 21 (-27.59%)
gcp-ingestionDocumentation and implementation of telemetry ingestion on Google Cloud Platform
Stars: ✭ 60 (+106.9%)
cloud-build-notifiersNotifier images for Cloud Build, complete with build status filtering and Google Secret Manager integration
Stars: ✭ 79 (+172.41%)
admixrAn R package for reproducible and automated ADMIXTOOLS analyses
Stars: ✭ 20 (-31.03%)
taskaWorkflow Management for Biomedical exploration
Stars: ✭ 29 (+0%)
unimapA EXPERIMENTAL fork of minimap2 optimized for assembly-to-reference alignment
Stars: ✭ 76 (+162.07%)
30Days-of-GCPResources for the 30 Days of GCP program
Stars: ✭ 26 (-10.34%)
PrimerMinerR mased batch sequence downloader, with primer development and in silico evaluation capabilities
Stars: ✭ 27 (-6.9%)
serverless-ktp-ocrServerless Indonesian Identity E-KTP OCR with Google Cloud Platform (GCP) - Cloud Functions, Cloud Storage, and Cloud PubSub
Stars: ✭ 54 (+86.21%)