ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+5564.29%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+1542.86%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (+407.14%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (+1085.71%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (+771.43%)
Foundations of HPC 2021This repository collects the materials from the course "Foundations of HPC", 2021, at the Data Science and Scientific Computing Department, University of Trieste
Stars: ✭ 22 (+57.14%)
SundialsSUNDIALS is a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. This is a mirror of current releases, and development will move here eventually. Pull requests are welcome for bug fixes and minor changes.
Stars: ✭ 194 (+1285.71%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (+435.71%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (+900%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (+0%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (+1521.43%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+2457.14%)
JugParallel programming with Python
Stars: ✭ 337 (+2307.14%)
SosSandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric Interface (OFI), and UCX. Please click on the Wiki tab for help with building and using SOS.
Stars: ✭ 34 (+142.86%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (+764.29%)
MfemLightweight, general, scalable C++ library for finite element methods
Stars: ✭ 667 (+4664.29%)
Training MaterialA collection of code examples as well as presentations for training purposes
Stars: ✭ 85 (+507.14%)
BatchtoolsTools for computation on batch systems
Stars: ✭ 127 (+807.14%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (+857.14%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (+964.29%)
SamraiStructured Adaptive Mesh Refinement Application Infrastructure - a scalable C++ framework for block-structured AMR application development
Stars: ✭ 160 (+1042.86%)
CharmThe Charm++ parallel programming system. Visit https://charmplusplus.org/ for more information.
Stars: ✭ 96 (+585.71%)
OpencoarraysA parallel application binary interface for Fortran 2018 compilers.
Stars: ✭ 151 (+978.57%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (+1464.29%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+2885.71%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+26278.57%)
boltOfficial BOLT Repository
Stars: ✭ 19 (+35.71%)
Armadillo CodeArmadillo: fast C++ library for linear algebra & scientific computing - http://arma.sourceforge.net
Stars: ✭ 388 (+2671.43%)
pyccelPython extension language using accelerators
Stars: ✭ 189 (+1250%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (+135.71%)
Future🚀 R package: future: Unified Parallel and Distributed Processing in R for Everyone
Stars: ✭ 735 (+5150%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (+57.14%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (+385.71%)
Coreparallel finite element unstructured meshes
Stars: ✭ 124 (+785.71%)
cruiseUser space POSIX-like file system in main memory
Stars: ✭ 27 (+92.86%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (+885.71%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+11621.43%)
Future.apply🚀 R package: future.apply - Apply Function to Elements in Parallel using Futures
Stars: ✭ 159 (+1035.71%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (+478.57%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (+550%)
pcluster-managerManage AWS ParallelCluster through an easy to use web interface
Stars: ✭ 67 (+378.57%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (+178.57%)
claw-compilerCLAW Compiler for Performance Portability
Stars: ✭ 38 (+171.43%)
t8codeParallel algorithms and data structures for tree-based AMR with arbitrary element shapes.
Stars: ✭ 37 (+164.29%)
matrix multiplicationParallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Stars: ✭ 41 (+192.86%)
libquoDynamic execution environments for coupled, thread-heterogeneous MPI+X applications
Stars: ✭ 21 (+50%)
bitpitOpen source library for scientific HPC
Stars: ✭ 80 (+471.43%)
hero-sdk⛔ DEPRECATED ⛔ HERO Software Development Kit
Stars: ✭ 21 (+50%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (+364.29%)
euler2d kokkosSimple 2d finite volume solver for Euler equations using c++ kokkos library
Stars: ✭ 27 (+92.86%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (+121.43%)
wxparaverwxParaver is a trace-based visualization and analysis tool designed to study quantitative detailed metrics and obtain qualitative knowledge of the performance of applications, libraries, processors and whole architectures.
Stars: ✭ 23 (+64.29%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (+185.71%)
hp2pHeavy Peer To Peer: a MPI based benchmark for network diagnostic
Stars: ✭ 17 (+21.43%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+507.14%)
FGPUNo description or website provided.
Stars: ✭ 30 (+114.29%)
frameworkThe Arcane Framework for HPC codes
Stars: ✭ 15 (+7.14%)