Batch ShipyardSimplify HPC and Batch workloads on Azure
Stars: ✭ 240 (+1311.76%)
HpcinfoInformation about many aspects of high-performance computing. Wiki content moved to ~/docs.
Stars: ✭ 171 (+905.88%)
mpiBenchMPI benchmark to test and measure collective performance
Stars: ✭ 39 (+129.41%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (+29.41%)
PfunitParallel Fortran Unit Testing Framework
Stars: ✭ 104 (+511.76%)
SIRIUSDomain specific library for electronic structure calculations
Stars: ✭ 87 (+411.76%)
Raxml NgRAxML Next Generation: faster, easier-to-use and more flexible
Stars: ✭ 191 (+1023.53%)
nbodykitAnalysis kit for large-scale structure datasets, the massively parallel way
Stars: ✭ 93 (+447.06%)
HorovodDistributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Stars: ✭ 11,943 (+70152.94%)
GenomicsDBHighly performant data storage in C++ for importing, querying and transforming variant data with C/C++/Java/Spark bindings. Used in gatk4.
Stars: ✭ 77 (+352.94%)
SchwimmbadA common interface to processing pools.
Stars: ✭ 82 (+382.35%)
sst-coreSST Structural Simulation Toolkit Parallel Discrete Event Core and Services
Stars: ✭ 82 (+382.35%)
azurehpcThis repository provides easy automation scripts for building a HPC environment in Azure. It also includes examples to build e2e environment and run some of the key HPC benchmarks and applications.
Stars: ✭ 102 (+500%)
bsuir-csn-cmsn-helperRepository containing ready-made laboratory works in the specialty of computing machines, systems and networks
Stars: ✭ 43 (+152.94%)
Abyss🔬 Assemble large genomes using short reads
Stars: ✭ 219 (+1188.24%)
GalaxyGalaxy is an asynchronous parallel visualization ray tracer for performant rendering in distributed computing environments. Galaxy builds upon Intel OSPRay and Intel Embree, including ray queueing and sending logic inspired by TACC GraviT.
Stars: ✭ 18 (+5.88%)
Primecount🚀 Fast prime counting function implementations
Stars: ✭ 193 (+1035.29%)
pbdMLNo description or website provided.
Stars: ✭ 13 (-23.53%)
TomsfastmathTomsFastMath is a fast public domain, open source, large integer arithmetic library written in portable ISO C.
Stars: ✭ 169 (+894.12%)
arborThe Arbor multi-compartment neural network simulation library.
Stars: ✭ 87 (+411.76%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (+629.41%)
SWCaffeA Deep Learning Framework customized for Sunway TaihuLight
Stars: ✭ 37 (+117.65%)
ParamonteParaMonte: Plain Powerful Parallel Monte Carlo and MCMC Library for Python, MATLAB, Fortran, C++, C.
Stars: ✭ 88 (+417.65%)
Theano-MPIMPI Parallel framework for training deep learning models built in Theano
Stars: ✭ 55 (+223.53%)
ParMmgDistributed parallelization of 3D volume mesh adaptation
Stars: ✭ 19 (+11.76%)
OmpiOpen MPI main development repository
Stars: ✭ 1,221 (+7082.35%)
faabricMessaging and state layer for distributed serverless applications
Stars: ✭ 39 (+129.41%)
api-specAPI Specififications
Stars: ✭ 30 (+76.47%)
h5fortran-mpiHDF5-MPI parallel Fortran object-oriented interface
Stars: ✭ 15 (-11.76%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (+0%)
gslibsparse communication library
Stars: ✭ 22 (+29.41%)
Aff3ctA fast simulator and a library dedicated to the channel coding.
Stars: ✭ 240 (+1311.76%)
ACCLAccelerated Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
Stars: ✭ 28 (+64.71%)
DmtcpDMTCP: Distributed MultiThreaded CheckPointing
Stars: ✭ 229 (+1247.06%)
raptorGeneral, high performance algebraic multigrid solver
Stars: ✭ 50 (+194.12%)
Mpi.jlMPI wrappers for Julia
Stars: ✭ 197 (+1058.82%)
sboxgatesProgram for finding low gate count implementations of S-boxes.
Stars: ✭ 30 (+76.47%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (+1029.41%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+129.41%)
Mpi OperatorKubernetes Operator for Allreduce-style Distributed Training
Stars: ✭ 190 (+1017.65%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (+23.53%)
Libgrape Lite🍇 A C++ library for parallel graph processing 🍇
Stars: ✭ 169 (+894.12%)
scrSCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability for MPI codes.
Stars: ✭ 84 (+394.12%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (+876.47%)
FluxUtils.jlSklearn Interface and Distributed Training for Flux.jl
Stars: ✭ 12 (-29.41%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (+688.24%)
alsvinnThe fast Finite Volume simulator with UQ support.
Stars: ✭ 22 (+29.41%)
MatexMachine Learning Toolkit for Extreme Scale (MaTEx)
Stars: ✭ 104 (+511.76%)
fdtd3dfdtd3d is an open source 1D, 2D, 3D FDTD electromagnetics solver with MPI, OpenMP and CUDA support for x86, arm, arm64 architectures
Stars: ✭ 77 (+352.94%)
Ytk Mp4jYtk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.
Stars: ✭ 102 (+500%)
ravelRavel MPI trace visualization tool
Stars: ✭ 26 (+52.94%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+400%)
XH5ForXDMF parallel partitioned mesh I/O on top of HDF5
Stars: ✭ 23 (+35.29%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (+117.65%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (+141.18%)
eventgradEvent-Triggered Communication in Parallel Machine Learning
Stars: ✭ 14 (-17.65%)
fmlFused Matrix Library
Stars: ✭ 24 (+41.18%)
EDLibExact diagonalization solver for quantum electron models
Stars: ✭ 18 (+5.88%)
az-hopThe Azure HPC On-Demand Platform provides an HPC Cluster Ready solution
Stars: ✭ 33 (+94.12%)