All Projects → deanmarchiori → analysis-flow

deanmarchiori / analysis-flow

Licence: MIT license
Data Analysis Workflows & Reproducibility Learning Resources

Projects that are alternatives of or similar to analysis-flow

ReproducibleScience
Short course on reproducible science: what, why, how
Stars: ✭ 23 (-78.7%)
Mutual labels:  reproducible-science, reproducibility
papers-as-modules
Software Papers as Software Modules: Towards a Culture of Reusable Results
Stars: ✭ 18 (-83.33%)
Mutual labels:  reproducible-science, reproducibility
researchcompendium
NOTE: This repo is archived. Please see https://github.com/benmarwick/rrtools for my current approach
Stars: ✭ 26 (-75.93%)
Mutual labels:  reproducible-science, reproducibility
r10e-ds-py
Reproducible Data Science in Python (SciPy 2019 Tutorial)
Stars: ✭ 12 (-88.89%)
Mutual labels:  reproducible-science, reproducibility
Vistrails
VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains detailed history information about the steps followed and data derived in the course of an exploratory task: VisTrails maintains provenance of data products, of the computational processes that derive these products and their executions.
Stars: ✭ 94 (-12.96%)
Mutual labels:  reproducible-science, reproducibility
Reproducibilidad
Reproducible Science: what, why, how
Stars: ✭ 39 (-63.89%)
Mutual labels:  reproducible-science, reproducibility
reprozip-examples
Examples and demos for ReproZip
Stars: ✭ 13 (-87.96%)
Mutual labels:  reproducible-science, reproducibility
ukbrest
ukbREST: efficient and streamlined data access for reproducible research of large biobanks
Stars: ✭ 32 (-70.37%)
Mutual labels:  reproducible-science, reproducibility
Rrtools
rrtools: Tools for Writing Reproducible Research in R
Stars: ✭ 508 (+370.37%)
Mutual labels:  reproducible-science, reproducibility
Wdl
Workflow Description Language - Specification and Implementations
Stars: ✭ 438 (+305.56%)
Mutual labels:  reproducible-science, reproducibility
Sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Stars: ✭ 3,678 (+3305.56%)
Mutual labels:  reproducible-science, reproducibility
Reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Stars: ✭ 231 (+113.89%)
Mutual labels:  reproducible-science, reproducibility
Awesome Reproducible Research
A curated list of reproducible research case studies, projects, tutorials, and media
Stars: ✭ 106 (-1.85%)
Mutual labels:  reproducible-science, reproducibility
hydra-zen
Pythonic functions for creating and enhancing Hydra applications
Stars: ✭ 165 (+52.78%)
Mutual labels:  reproducible-science, reproducibility
fertile
creating optimal conditions for reproducibility
Stars: ✭ 52 (-51.85%)
Mutual labels:  reproducibility
reproducibility-guide
⛔ ARCHIVED ⛔
Stars: ✭ 119 (+10.19%)
Mutual labels:  reproducibility
Go Tooling Workshop
A workshop covering all the tools gophers use in their day to day life
Stars: ✭ 2,683 (+2384.26%)
Mutual labels:  tooling
Bootboot
Dualboot your Ruby app made easy
Stars: ✭ 239 (+121.3%)
Mutual labels:  tooling
Daxif
A framework for automating a lot of xRM development processses. By using simple F# script commands/files one can save a lot of time and effort during this process by using Delegates DAXIF# library.
Stars: ✭ 37 (-65.74%)
Mutual labels:  tooling
benchmark VAE
Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)
Stars: ✭ 1,211 (+1021.3%)
Mutual labels:  reproducibility

Data Analysis Workflows & Reproducibility Learning Resources

This repository aims to collect resources relating to workflow and tooling choices that promote reproducibility and best practice in data analysis and data science projects.

The resources have been organised as:

  • R Packages
  • Books
  • Papers
  • Blog Posts
  • Talks and Videos

If you would like to make a contribution, I would be glad to include it. Please file an issue, submit a PR or email me on [email protected]


R Packages

Package About Available on
drake An R-focused pipeline toolkit for reproducibility and high-performance computing CRAN
ProjectTemplate ProjectTemplate is a system for automating the thoughtless parts of a data analysis project CRAN
workflowr A Framework for Reproducible and Collaborative Data Science CRAN
rrtools Tools for Writing Reproducible Research in R Github
orderly Lightweight Reproducible Reporting for R CRAN
fnmate A function definition generator Github
dflow Automatically setup a drake project Github
represtools Basic utility functions to support reproducible research CRAN
starters R Package for initializing projects for various R activities Github
targets Function-oriented Make-like declarative workflows for R Github

Books

Title Authors Year
Agile Data Science with R - A workflow Edwin Thoen 2020
What They Forgot to Teach You About R Jennifer Bryan, Jim Hester 2020
The Turing Way: A Handbook for Reproducible Data Science Becky Arnold, Louise Bowler, Sarah Gibson, Patricia Herterich, Rosie Higman, Kirstie Whitaker 2019

Papers

Title Citation
Packaging Data Analytical Work Reproducibly Using R (and Friends) Ben Marwick, Carl Boettiger & Lincoln Mullen (2018) Packaging Data Analytical Work Reproducibly Using R (and Friends), The American Statistician, 72:1, 80-88, DOI: 10.1080/00031305.2017.1375986
Opinionated analysis development Parker H. 2017. Opinionated analysis development. PeerJ Preprints 5:e3210v1 https://doi.org/10.7287/peerj.preprints.3210v1

Blog Posts


Talks

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].