PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (-28.96%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (-67%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-94.87%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (-62.41%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (-69.24%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+232.1%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (-16.64%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-91.1%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (-66.1%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (-52.25%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (-67.45%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (-85.07%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (-28.69%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-92.36%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-93.88%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-96.76%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (-73.47%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-95.32%)
MassivEfficient Haskell Arrays featuring Parallel computation
Stars: ✭ 328 (-70.5%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (-75.9%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (-72.39%)
Arrayy🗃 Array manipulation library for PHP, called Arrayy!
Stars: ✭ 363 (-67.36%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (-66.73%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (-66.37%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (-74.1%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (-75.36%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (-71.85%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (-75.27%)
3GPU-accelerated micromagnetic simulator
Stars: ✭ 324 (-70.86%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+223.29%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (-62.59%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (-68.26%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (-59.35%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+426.98%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (-59.35%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (-67.81%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (-64.93%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+292.99%)
VuhVulkan compute for people
Stars: ✭ 264 (-76.26%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+405.85%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (-51.26%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (-50.09%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (-45.95%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (-47.3%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+408.63%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (-43.71%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-96.31%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-96.31%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (-54.05%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (-44.69%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (-35.43%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (-33.45%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (-27.79%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (-30.13%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (-19.87%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (-22.21%)
BlitzBlitz++ Multi-Dimensional Array Library for C++
Stars: ✭ 257 (-76.89%)