HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
PySDMPythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab
komputeGeneral purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
tinker9Tinker9: Next Generation of Tinker with GPU Support
taichi ptprogressive path tracer written in taichi
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
GooFitCode repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
aer-engine♒ An OpenGL 4.3 / C++ 11 rendering engine oriented towards animation.
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
RaytrAMPShooting and bouncing rays method for radar cross-section calculations, accelerated with BVH algorithm running on GPU (C++ AMP).
gpuhdMassively Parallel Huffman Decoding on GPUs
qmcA Quasi-Monte-Carlo Integrator Library with CUDA Support
beatmupBeatmup: image and signal processing library
euler2d cudaFortran2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
PetIBMPetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
LvArrayPortable HPC Containers (C++)
hipercHigh Performance Computing Strategies for Boundary Value Problems
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
gpuowlGPU Mersenne primality test.
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
notebooksA docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.