All Categories → Science → reproducibility

Top 94 reproducibility open source projects

Jupyterwith
declarative and reproducible Jupyter environments - powered by Nix
Mach Nix
Create highly reproducible python environments
Reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Popper
Container-native task automation engine.
Plynx
PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
Anaconda Project
Tool for encapsulating, running, and reproducing data science projects
Nn Template
Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.
Renku
The Renku Project provides a platform and tools for reproducible and collaborative data analysis.
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Dna Seq Gatk Variant Calling
This Snakemake pipeline implements the GATK best-practices workflow
Datapackager
An R package to enable reproducible data processing, packaging and sharing.
Reproducibility Guide
project page for creating a guide to reproducible research
Starters
R Package 📦 for initializing projects for various R activities 🔩
Awesome Reproducible Research
A curated list of reproducible research case studies, projects, tutorials, and media
Vps Comparison
A comparison between some VPS providers. It uses Ansible to perform a series of automated benchmark tests over the VPS servers that you specify. It allows the reproducibility of those tests by anyone that wanted to compare these results to their own. All the tests results are available in order to provide independence and transparency.
Enmf
This is our implementation of ENMF: Efficient Neural Matrix Factorization (TOIS. 38, 2020). This also provides a fair evaluation of existing state-of-the-art recommendation models.
Reproducible Research
A Reproducible Data Analysis Workflow with R Markdown, Git, Make, and Docker
Vistrails
VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains detailed history information about the steps followed and data derived in the course of an exploratory task: VisTrails maintains provenance of data products, of the computational processes that derive these products and their executions.
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Reproduce Stock Market Direction Random Forests
Reproduce research from paper "Predicting the direction of stock market prices using random forest"
Garage
A toolkit for reproducible reinforcement learning research.
Tensorhub
TensorHub is a library built on top of TensorFlow 2.0 to provide simple, modular and repeatable abstractions to accelerate deep learning research.
Dvc
🦉Data Version Control | Git for Data & Models | ML Experiments Management
Snakemake
This is the development home of the workflow management system Snakemake. For general information, see
Recsys2019 deeplearning evaluation
This is the repository of our article published in RecSys 2019 "Are We Really Making Much Progress? A Worrying Analysis of Recent Neural Recommendation Approaches" and of several follow-up studies.
Reprex
Render bits of R code for sharing, e.g., on GitHub or StackOverflow.
Labnotebook
LabNotebook is a tool that allows you to flexibly monitor, record, save, and query all your machine learning experiments.
Rrtools
rrtools: Tools for Writing Reproducible Research in R
Mimicry
[CVPR 2020 Workshop] A PyTorch GAN library that reproduces research results for popular GANs.
Gtsummary
Presentation-Ready Data Summary and Analytic Result Tables
Wdl
Workflow Description Language - Specification and Implementations
Ck
Collective Knowledge framework (CK) helps to organize black-box research software as a database of reusable components and micro-services with common APIs, automation actions and extensible meta descriptions. See real-world use cases from Arm, General Motors, ACM, Raspberry Pi foundation and others:
Datmo
Open source production model management tool for data scientists
Sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
ml-project-template
ML project template facilitating both research and production phases.
r10e-ds-py
Reproducible Data Science in Python (SciPy 2019 Tutorial)
reproducible
A set of tools for R that enhance reproducibility beyond package management
scooby
🐶 🕵️ Great Dane turned Python environment detective
git-ghost
Synchronize your working directory efficiently to a remote place without committing the changes.
ten-years
Ten Years Reproducibility Challenge
1-60 of 94 reproducibility projects