ATOMOAtomo: Communication-efficient Learning via Atomic Sparsification
Stars: ✭ 23 (+64.29%)
arborThe Arbor multi-compartment neural network simulation library.
Stars: ✭ 87 (+521.43%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (+1464.29%)
TomsfastmathTomsFastMath is a fast public domain, open source, large integer arithmetic library written in portable ISO C.
Stars: ✭ 169 (+1107.14%)
azurehpcThis repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environment and run some of the key HPC benchmarks and applications.
Stars: ✭ 102 (+628.57%)
GalaxyGalaxy is an asynchronous parallel visualization ray tracer for performant rendering in distributed computing environments. Galaxy builds upon Intel OSPRay and Intel Embree, including ray queueing and sending logic inspired by TACC GraviT.
Stars: ✭ 18 (+28.57%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (+1278.57%)
XH5ForXDMF parallel partitioned mesh I/O on top of HDF5
Stars: ✭ 23 (+64.29%)
Theano-MPIMPI Parallel framework for training deep learning models built in Theano
Stars: ✭ 55 (+292.86%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (+785.71%)
ParamonteParaMonte: Plain Powerful Parallel Monte Carlo and MCMC Library for Python, MATLAB, Fortran, C++, C.
Stars: ✭ 88 (+528.57%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (+57.14%)
raptorGeneral, high performance algebraic multigrid solver
Stars: ✭ 50 (+257.14%)
Batch ShipyardSimplify HPC and Batch workloads on Azure
Stars: ✭ 240 (+1614.29%)
FluxUtils.jlSklearn Interface and Distributed Training for Flux.jl
Stars: ✭ 12 (-14.29%)
Raxml NgRAxML Next Generation: faster, easier-to-use and more flexible
Stars: ✭ 191 (+1264.29%)
mpiBenchMPI benchmark to test and measure collective performance
Stars: ✭ 39 (+178.57%)
HpcinfoInformation about many aspects of high-performance computing. Wiki content moved to ~/docs.
Stars: ✭ 171 (+1121.43%)
h5fortran-mpiHDF5-MPI parallel Fortran object-oriented interface
Stars: ✭ 15 (+7.14%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+85207.14%)
PfunitParallel Fortran Unit Testing Framework
Stars: ✭ 104 (+642.86%)
faabricMessaging and state layer for distributed serverless applications
Stars: ✭ 39 (+178.57%)
az-hopThe Azure HPC On-Demand Platform provides an HPC Cluster Ready solution
Stars: ✭ 33 (+135.71%)
SchwimmbadA common interface to processing pools.
Stars: ✭ 82 (+485.71%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (+435.71%)
api-specAPI Specififications
Stars: ✭ 30 (+114.29%)
SIRIUSDomain specific library for electronic structure calculations
Stars: ✭ 87 (+521.43%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (+21.43%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (+564.29%)
Aff3ctA fast simulator and a library dedicated to the channel coding.
Stars: ✭ 240 (+1614.29%)
mlsCSCE 585 - Machine Learning Systems
Stars: ✭ 36 (+157.14%)
DmtcpDMTCP: Distributed MultiThreaded CheckPointing
Stars: ✭ 229 (+1535.71%)
fmlFused Matrix Library
Stars: ✭ 24 (+71.43%)
Mpi.jlMPI wrappers for Julia
Stars: ✭ 197 (+1307.14%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+178.57%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (+1271.43%)
SWCaffeA Deep Learning Framework customized for Sunway TaihuLight
Stars: ✭ 37 (+164.29%)
Mpi OperatorKubernetes Operator for Allreduce-style Distributed Training
Stars: ✭ 190 (+1257.14%)
scrSCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability for MPI codes.
Stars: ✭ 84 (+500%)
Libgrape Lite🍇 A C++ library for parallel graph processing 🍇
Stars: ✭ 169 (+1107.14%)
pbdMLNo description or website provided.
Stars: ✭ 13 (-7.14%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (+1085.71%)
alsvinnThe fast Finite Volume simulator with UQ support.
Stars: ✭ 22 (+57.14%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (+857.14%)
EDLibExact diagonalization solver for quantum electron models
Stars: ✭ 18 (+28.57%)
MatexMachine Learning Toolkit for Extreme Scale (MaTEx)
Stars: ✭ 104 (+642.86%)
ravelRavel MPI trace visualization tool
Stars: ✭ 26 (+85.71%)
Ytk Mp4jYtk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.
Stars: ✭ 102 (+628.57%)
bsuir-csn-cmsn-helperRepository containing ready-made laboratory works in the specialty of computing machines, systems and networks
Stars: ✭ 43 (+207.14%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+507.14%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (+164.29%)
OmpiOpen MPI main development repository
Stars: ✭ 1,221 (+8621.43%)
sst-coreSST Structural Simulation Toolkit Parallel Discrete Event Core and Services
Stars: ✭ 82 (+485.71%)
Pismrepository for the Parallel Ice Sheet Model (PISM)
Stars: ✭ 61 (+335.71%)
ParMmgDistributed parallelization of 3D volume mesh adaptation
Stars: ✭ 19 (+35.71%)
ACCLAccelerated Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
Stars: ✭ 28 (+100%)
fdtd3dfdtd3d is an open source 1D, 2D, 3D FDTD electromagnetics solver with MPI, OpenMP and CUDA support for x86, arm, arm64 architectures
Stars: ✭ 77 (+450%)
sboxgatesProgram for finding low gate count implementations of S-boxes.
Stars: ✭ 30 (+114.29%)
gslibsparse communication library
Stars: ✭ 22 (+57.14%)