All Projects → mllg → Batchtools

mllg / Batchtools

Licence: lgpl-3.0
Tools for computation on batch systems

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Batchtools

Dash
DASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (+5.51%)
Mutual labels:  parallel-computing, hpc, high-performance-computing
Future
🚀 R package: future: Unified Parallel and Distributed Processing in R for Everyone
Stars: ✭ 735 (+478.74%)
Mutual labels:  parallel-computing, cran, hpc
Sundials
SUNDIALS is a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. This is a mirror of current releases, and development will move here eventually. Pull requests are welcome for bug fixes and minor changes.
Stars: ✭ 194 (+52.76%)
Mutual labels:  parallel-computing, hpc, high-performance-computing
Mfem
Lightweight, general, scalable C++ library for finite element methods
Stars: ✭ 667 (+425.2%)
Mutual labels:  parallel-computing, hpc, high-performance-computing
Opencoarrays
A parallel application binary interface for Fortran 2018 compilers.
Stars: ✭ 151 (+18.9%)
Mutual labels:  parallel-computing, hpc, high-performance-computing
t8code
Parallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (-70.87%)
Mutual labels:  hpc, parallel-computing, high-performance-computing
ParallelUtilities.jl
Fast and easy parallel mapreduce on HPC clusters
Stars: ✭ 28 (-77.95%)
Mutual labels:  hpc, parallel-computing, high-performance-computing
Jug
Parallel programming with Python
Stars: ✭ 337 (+165.35%)
Mutual labels:  parallel-computing, hpc
Taskflow
A General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4725.2%)
Mutual labels:  parallel-computing, high-performance-computing
Core
parallel finite element unstructured meshes
Stars: ✭ 124 (-2.36%)
Mutual labels:  parallel-computing, hpc
Kokkos
Kokkos C++ Performance Portability Programming EcoSystem: The Programming Model - Parallel Execution and Memory Abstraction
Stars: ✭ 744 (+485.83%)
Mutual labels:  parallel-computing, high-performance-computing
Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+524.41%)
Mutual labels:  parallel-computing, high-performance-computing
Sos
Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric Interface (OFI), and UCX. Please click on the Wiki tab for help with building and using SOS.
Stars: ✭ 34 (-73.23%)
Mutual labels:  parallel-computing, hpc
Pyhpc Benchmarks
A suite of benchmarks to test the sequential CPU and GPU performance of most popular high-performance libraries for Python.
Stars: ✭ 119 (-6.3%)
Mutual labels:  parallel-computing, high-performance-computing
Targets
Function-oriented Make-like declarative workflows for R
Stars: ✭ 293 (+130.71%)
Mutual labels:  reproducibility, high-performance-computing
Graphit
GraphIt - A High-Performance Domain Specific Language for Graph Analytics
Stars: ✭ 254 (+100%)
Mutual labels:  parallel-computing, high-performance-computing
MOT
Multi-threaded Optimization Toolbox
Stars: ✭ 28 (-77.95%)
Mutual labels:  parallel-computing, high-performance-computing
Onemkl
oneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-3.94%)
Mutual labels:  parallel-computing, hpc
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-55.12%)
Mutual labels:  reproducibility, high-performance-computing
Geopm
Global Extensible Open Power Manager
Stars: ✭ 57 (-55.12%)
Mutual labels:  hpc, high-performance-computing

batchtools

JOSS Publicatoin CRAN Status Badge

As a successor of the packages BatchJobs and BatchExperiments, batchtools provides a parallel implementation of Map for high performance computing systems managed by schedulers like Slurm, Sun Grid Engine, OpenLava, TORQUE/OpenPBS, Load Sharing Facility (LSF) or Docker Swarm (see the setup section in the vignette).

Main features:

  • Convenience: All relevant batch system operations (submitting, listing, killing) are either handled internally or abstracted via simple R functions
  • Portability: With a well-defined interface, the source is independent from the underlying batch system - prototype locally, deploy on any high performance cluster
  • Reproducibility: Every computational part has an associated seed stored in a data base which ensures reproducibility even when the underlying batch system changes
  • Abstraction: The code layers for algorithms, experiment definitions and execution are cleanly separated and allow to write readable and maintainable code to manage large scale computer experiments

Installation

Install the stable version from CRAN:

install.packages("batchtools")

For the development version, use devtools:

devtools::install_github("mllg/batchtools")

Next, you need to setup batchtools for your HPC (it will run sequentially otherwise). See the vignette for instructions.

Why batchtools?

The development of BatchJobs and BatchExperiments is discontinued for the following reasons:

  • Maintainability: The packages BatchJobs and BatchExperiments are tightly connected which makes maintenance difficult. Changes have to be synchronized and tested against the current CRAN versions for compatibility. Furthermore, BatchExperiments violates CRAN policies by calling internal functions of BatchJobs.
  • Data base issues: Although we invested weeks to mitigate issues with locks of the SQLite data base or file system (staged queries, file system timeouts, ...), BatchJobs kept working unreliable on some systems with high latency under certain conditions. This made BatchJobs unusable for many users.

BatchJobs and BatchExperiments will remain on CRAN, but new features are unlikely to be ported back. The vignette contains a section comparing the packages.

Resources

Citation

Please cite the JOSS paper using the following BibTeX entry:

@article{,
  doi = {10.21105/joss.00135},
  url = {https://doi.org/10.21105/joss.00135},
  year  = {2017},
  month = {feb},
  publisher = {The Open Journal},
  volume = {2},
  number = {10},
  author = {Michel Lang and Bernd Bischl and Dirk Surmann},
  title = {batchtools: Tools for R to work on batch systems},
  journal = {The Journal of Open Source Software}
}

Related Software

  • The High Performance Computing Task View lists the most relevant packages for scientific computing with R.
  • clustermq is a similar approach which also supports multiple schedulers. Uses the ZeroMQ network protocol for communication, and shines if you have millions of fast jobs.
  • batch assists in splitting and submitting jobs to LSF and MOSIX clusters.
  • flowr supports LSF, Slurm, TORQUE and Moab and provides a scatter-gather approach to define computational jobs.
  • future.batchtools implements batchtools as backend for future.
  • doFuture together with future.batchtools connects batchtools to foreach.
  • drake uses graphs to define computational jobs. batchtools is used as a backend via future.batchtools.

Contributing to batchtools

This R package is licensed under the LGPL-3. If you encounter problems using this software (lack of documentation, misleading or wrong documentation, unexpected behaviour, bugs, ...) or just want to suggest features, please open an issue in the issue tracker. Pull requests are welcome and will be included at the discretion of the author. If you have customized a template file for your (larger) computing site, please share it: fork the repository, place your template in inst/templates and send a pull request.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].