All Projects → VIDA-NYU → Reprozip

VIDA-NYU / Reprozip

Licence: bsd-3-clause
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Reprozip

reprozip-examples
Examples and demos for ReproZip
Stars: ✭ 13 (-94.37%)
Mutual labels:  reproducible-research, reproducible-science, scientific-computing, reproducibility
researchcompendium
NOTE: This repo is archived. Please see https://github.com/benmarwick/rrtools for my current approach
Stars: ✭ 26 (-88.74%)
Mutual labels:  reproducible-research, reproducible-science, reproducibility
ReproducibleScience
Short course on reproducible science: what, why, how
Stars: ✭ 23 (-90.04%)
Mutual labels:  reproducible-research, reproducible-science, reproducibility
Freud
Powerful, efficient particle trajectory analysis in scientific Python.
Stars: ✭ 118 (-48.92%)
Mutual labels:  hacktoberfest, science, scientific-computing
emp
🔬 Empirical CLI
Stars: ✭ 42 (-81.82%)
Mutual labels:  science, reproducible-research, reproducible-science
ukbrest
ukbREST: efficient and streamlined data access for reproducible research of large biobanks
Stars: ✭ 32 (-86.15%)
Mutual labels:  reproducible-research, reproducible-science, reproducibility
r10e-ds-py
Reproducible Data Science in Python (SciPy 2019 Tutorial)
Stars: ✭ 12 (-94.81%)
Mutual labels:  reproducible-research, reproducible-science, reproducibility
Awesome Reproducible Research
A curated list of reproducible research case studies, projects, tutorials, and media
Stars: ✭ 106 (-54.11%)
Mutual labels:  reproducible-research, reproducibility, reproducible-science
papers-as-modules
Software Papers as Software Modules: Towards a Culture of Reusable Results
Stars: ✭ 18 (-92.21%)
Mutual labels:  reproducible-research, reproducible-science, reproducibility
Sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Stars: ✭ 3,678 (+1492.21%)
Mutual labels:  reproducible-research, reproducibility, reproducible-science
Poliastro
poliastro - 🚀 Astrodynamics in Python
Stars: ✭ 462 (+100%)
Mutual labels:  hacktoberfest, science, scientific-computing
Itk
Insight Toolkit (ITK) -- Official Repository. ITK builds on a proven, spatially-oriented architecture for processing, segmentation, and registration of scientific images in two, three, or more dimensions.
Stars: ✭ 801 (+246.75%)
Mutual labels:  hacktoberfest, scientific-computing, reproducible-research
Rrtools
rrtools: Tools for Writing Reproducible Research in R
Stars: ✭ 508 (+119.91%)
Mutual labels:  reproducible-research, reproducibility, reproducible-science
Analysispreservation.cern.ch
Source code for the CERN Analysis Preservation portal
Stars: ✭ 37 (-83.98%)
Mutual labels:  hacktoberfest, reproducible-research, reproducible-science
Nextflow
A DSL for data-driven computational pipelines
Stars: ✭ 1,337 (+478.79%)
Mutual labels:  reproducible-research, reproducible-science
Everware
Everware is about re-useable science, it allows people to jump right in to your research code.
Stars: ✭ 112 (-51.52%)
Mutual labels:  reproducible-research, reproducible-science
Reproducibility Guide
project page for creating a guide to reproducible research
Stars: ✭ 116 (-49.78%)
Mutual labels:  reproducible-research, reproducibility
Enmf
This is our implementation of ENMF: Efficient Neural Matrix Factorization (TOIS. 38, 2020). This also provides a fair evaluation of existing state-of-the-art recommendation models.
Stars: ✭ 96 (-58.44%)
Mutual labels:  reproducible-research, reproducibility
Tardis
TARDIS - Temperature And Radiative Diffusion In Supernovae
Stars: ✭ 112 (-51.52%)
Mutual labels:  hacktoberfest, science
Steppy
Lightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (-48.48%)
Mutual labels:  reproducible-research, reproducibility

Build Status Coverage Status Documentation Status Matrix Say Thanks! status DOI

ReproZip

ReproZip is a tool aimed at simplifying the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

It tracks operating system calls and creates a package that contains all the binaries, files and dependencies required to run a given command on the author's computational environment (packing step). A reviewer can then extract the experiment in his environment to reproduce the results (unpacking step).

Quickstart

We have an example repository with a variety of different software. Don't hesitate to check it out, and contribute your own example if use ReproZip for something new!

Packing

Packing experiments is only available for Linux distributions. In the environment where the experiment is originally executed, first install reprozip:

$ pip install reprozip

Then, run your experiment with reprozip. Suppose you execute your experiment by originally running the following command:

$ ./myexperiment -my --options inputs/somefile.csv other_file_here.bin

To run it with reprozip, you just need to use the prefix reprozip trace:

$ reprozip trace ./myexperiment -my --options inputs/somefile.csv other_file_here.bin

This command creates a .reprozip-trace directory, in which you'll find the configuration file, named config.yml. You can edit the command line and environment variables, and choose which files to pack.

If you are using Debian or Ubuntu, most of these files (library dependencies) are organized by package. You can add or remove files, or choose not to include a package by changing option packfiles from true to false. In this way, smaller packs can be created with reprozip (if space is an issue), and reprounzip can download these files from the package manager; however, note this is only available for Debian and Ubuntu for now, and also be aware that package versions might differ. Choosing which files to pack is also important to remove sensitive information and third-party software that is not open source and should not be distributed.

Once done editing the configuration file (or even if you did not change anything), run the following command to create a ReproZip package named my_experiment:

$ reprozip pack my_experiment.rpz

Voilà! Now your experiment has been packed, and you can send it to your collaborators, reviewers, and researchers around the world!

Note that you can open the help message for any reprozip command by using the flag -h.

Unpacking

Do you need to unpack an experiment in a Linux machine? Easy! First, install reprounzip:

$ pip install reprounzip

Then, if you want to unpack everything in a single directory named mydirectory and execute the experiment from there, use the prefix reprounzip directory:

$ reprounzip directory setup my_experiment.rpz mydirectory
$ reprounzip directory run mydirectory

In case you prefer to build a chroot environment under mychroot, use the prefix reprounzip chroot:

$ reprounzip chroot setup my_experiment.rpz mychroot
$ reprounzip chroot run mychroot

Note that the previous options do not interfere with the original configuration of the environment, so don't worry! If you are using Debian or Ubuntu, reprounzip also has an option to install all the library dependencies directly on the machine using package managers (rather than just copying the files from the .rpz package). Be aware that this will interfere in your environment and it may update your library packages, so use it at your own risk! For this option, just use the prefix reprounzip installpkgs:

$ reprounzip installpkgs my_experiment.rpz

What if you want to reproduce the experiment in Windows or Mac OS X? You can build a virtual machine with the experiment! Easy as well! First, install the plugin reprounzip-vagrant:

$ pip install reprounzip-vagrant

Note that (i) you must install reprounzip first, and (ii) the plugin requires having Vagrant installed. Then, use the prefix reprounzip vagrant to create and start a virtual machine under directory mytemplate:

$ reprounzip vagrant setup my_experiment.rpz mytemplate

To execute the experiment, simply run:

$ reprounzip vagrant run mytemplate

Alternatively, you may use Docker containers to reproduce the experiment, which also works under Linux, Mac OS X, and Windows! First, install the plugin reprounzip-docker:

$ pip install reprounzip-docker

Then, assuming that you want to create the container under directory mytemplate, simply use the prefix reprounzip docker:

$ reprounzip docker setup my_experiment.rpz mytemplate
$ reprounzip docker run mytemplate

Remember that you can open the help message and learn more about other available flags and options by using the flag -h for any reprounzip command.

Citing ReproZip

Please use the following when citing ReproZip (BibTeX):

ReproZip: Computational Reproducibility With Ease
F. Chirigati, R. Rampin, D. Shasha, and J. Freire.
In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 2085-2088, 2016

Contribute

Please subscribe to and contact the [email protected] mailing list for questions, suggestions and discussions about using reprozip.

Use the [email protected] mailing list for any questions about reprozip source code.

Bugs and feature plannings are tracked in the GitHub issues. Feel free to add an issue!

To suggest changes to this source code, feel free to raise a GitHub pull request. Any contributions received are assumed to be covered by the BSD 3-Clause license. We might ask you to sign a Contributor License Agreement before accepting a larger contribution.

License

  • Copyright (C) 2014-2017, New York University

Licensed under a BSD 3-Clause license. See the file LICENSE.txt for details.

Links and References

For more detailed information, please refer to our website, as well as to our documentation.

ReproZip is currently being developed at NYU. The team includes:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].