tinker9Tinker9: Next Generation of Tinker with GPU Support
Stars: ✭ 31 (-48.33%)
RaytrAMPShooting and bouncing rays method for radar cross-section calculations, accelerated with BVH algorithm running on GPU (C++ AMP).
Stars: ✭ 49 (-18.33%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+378.33%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+41.67%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-63.33%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (+483.33%)
GooFitCode repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
Stars: ✭ 112 (+86.67%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+1133.33%)
beatmupBeatmup: image and signal processing library
Stars: ✭ 168 (+180%)
XLearning-GPUqihoo360 xlearning with GPU support; AI on Hadoop
Stars: ✭ 22 (-63.33%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-80%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-40%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+488.33%)
PySDMPythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab
Stars: ✭ 26 (-56.67%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1221.67%)
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Stars: ✭ 79 (+31.67%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+421.67%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-40%)
qmcA Quasi-Monte-Carlo Integrator Library with CUDA Support
Stars: ✭ 17 (-71.67%)
VuhVulkan compute for people
Stars: ✭ 264 (+340%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-38.33%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+785%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-63.33%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (+13.33%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-6.67%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+503.33%)
ObsidianObsidian Language Repository
Stars: ✭ 38 (-36.67%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1445%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-76.67%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+490%)
komputeGeneral purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
Stars: ✭ 872 (+1353.33%)
Fractional differencing gpuRapid large-scale fractional differencing with RAPIDS to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.
Stars: ✭ 38 (-36.67%)
taichi ptprogressive path tracer written in taichi
Stars: ✭ 20 (-66.67%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+470%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1151.67%)
aer-engine♒ An OpenGL 4.3 / C++ 11 rendering engine oriented towards animation.
Stars: ✭ 26 (-56.67%)
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
Stars: ✭ 65 (+8.33%)
Raspberrypi tempmonRaspberry pi CPU temperature monitor with many functions such as logging, GPIO output, graphing, email, alarm, notifications and stress testing. Python 3.
Stars: ✭ 52 (-13.33%)
gpuhdMassively Parallel Huffman Decoding on GPUs
Stars: ✭ 30 (-50%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+343.33%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-71.67%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+901.67%)
euler2d cudaFortran2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)
Stars: ✭ 24 (-60%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+596.67%)
1833718.337 - Parallel Computing and Scientific Machine Learning
Stars: ✭ 834 (+1290%)
PetIBMPetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
Stars: ✭ 80 (+33.33%)
articThe AlteRnaTive Impala Compiler
Stars: ✭ 16 (-73.33%)
LvArrayPortable HPC Containers (C++)
Stars: ✭ 37 (-38.33%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+653.33%)
PyMFEMPython wrapper for MFEM
Stars: ✭ 91 (+51.67%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-5%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-31.67%)
BindsnetSimulation of spiking neural networks (SNNs) using PyTorch.
Stars: ✭ 837 (+1295%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+528.33%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-5%)