GpurR interface to use GPU's
Awesome Webgpu😎 Curated list of awesome things around WebGPU ecosystem.
FastA framework for GPU based high-performance medical image processing and visualization
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
PelemayPelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
ClvkExperimental implementation of OpenCL on Vulkan
GinkgoNumerical linear algebra software package
FastflowFastFlow pattern-based parallel programming framework (formerly on sourceforge)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
OpenclgaA Python Library for Genetic Algorithm on OpenCL
DeepnetDeep.Net machine learning framework for F#
EmuThe write-once-run-anywhere GPGPU library for Rust
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
PycudaCUDA integration for Python, plus shiny features
Sushi2Matrix Library for JavaScript
HeteroflowConcurrent CPU-GPU Programming using Task Models
Raspberrypi tempmonRaspberry pi CPU temperature monitor with many functions such as logging, GPIO output, graphing, email, alarm, notifications and stress testing. Python 3.
SixtyfourHow fast can we brute force a 64-bit comparison?
Fractional differencing gpuRapid large-scale fractional differencing with RAPIDS to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
1833718.337 - Parallel Computing and Scientific Machine Learning
BindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
AccelerateEmbedded language for high-performance array computations
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
VuhVulkan compute for people
PaiResource scheduling and cluster management for AI
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
articThe AlteRnaTive Impala Compiler
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.