PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+90.72%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-72.57%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+216.88%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (-61.6%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+469.62%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+49.37%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (-33.33%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+76.37%)
Raspberrypi tempmonRaspberry pi CPU temperature monitor with many functions such as logging, GPIO output, graphing, email, alarm, notifications and stress testing. Python 3.
Stars: ✭ 52 (-78.06%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+291.14%)
ObsidianObsidian Language Repository
Stars: ✭ 38 (-83.97%)
OpenclgaA Python Library for Genetic Algorithm on OpenCL
Stars: ✭ 103 (-56.54%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+153.59%)
PelemayPelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
Stars: ✭ 161 (-32.07%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+52.74%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+44.3%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (-24.47%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+12.24%)
Sushi2Matrix Library for JavaScript
Stars: ✭ 60 (-74.68%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-93.25%)
FastflowFastFlow pattern-based parallel programming framework (formerly on sourceforge)
Stars: ✭ 137 (-42.19%)
Fractional differencing gpuRapid large-scale fractional differencing with RAPIDS to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.
Stars: ✭ 38 (-83.97%)
BindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
Stars: ✭ 837 (+253.16%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-64.14%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (-45.57%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+234.6%)
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Stars: ✭ 168 (-29.11%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+212.24%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-58.23%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+124.05%)
Awesome Webgpu😎 Curated list of awesome things around WebGPU ecosystem.
Stars: ✭ 182 (-23.21%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+59.07%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+470.04%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+48.95%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-33.33%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (+47.68%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-67.93%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+32.07%)
GpurR interface to use GPU's
Stars: ✭ 208 (-12.24%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+21.1%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+369.2%)
VuhVulkan compute for people
Stars: ✭ 264 (+11.39%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-37.13%)
XLearning-GPUqihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-90.72%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-75.95%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-71.31%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (-26.58%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-75.95%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-82.7%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-94.94%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-84.81%)
Tf Quant FinanceHigh-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+1134.18%)
LingvoLingvo
Stars: ✭ 2,361 (+896.2%)
PaiResource scheduling and cluster management for AI
Stars: ✭ 2,223 (+837.97%)
1833718.337 - Parallel Computing and Scientific Machine Learning
Stars: ✭ 834 (+251.9%)