HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (+50%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-42.11%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (+160.53%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (-28.95%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-55.26%)
gpuhdMassively Parallel Huffman Decoding on GPUs
Stars: ✭ 30 (-21.05%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (+100%)
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Stars: ✭ 168 (+342.11%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+800%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (+50%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+1297.37%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+315.79%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (+357.89%)
VuhVulkan compute for people
Stars: ✭ 264 (+594.74%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+3452.63%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (+239.47%)
andromedaGLSL-targetting embedded compiler, and OpenGL rendering engine.
Stars: ✭ 75 (+97.37%)
PetIBMPetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
Stars: ✭ 80 (+110.53%)
taichi ptprogressive path tracer written in taichi
Stars: ✭ 20 (-47.37%)
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
Stars: ✭ 65 (+71.05%)
LvArrayPortable HPC Containers (C++)
Stars: ✭ 37 (-2.63%)
pytodTOD: GPU-accelerated Outlier Detection via Tensor Operations
Stars: ✭ 131 (+244.74%)
Nexus🖼️ Actionscript 3, GPU accelerated 2D game engine using Stage3D
Stars: ✭ 12 (-68.42%)
QUICKQUICK: A GPU-enabled ab intio quantum chemistry software package
Stars: ✭ 79 (+107.89%)
GoldenSunA path tracer based on hardware ray tracing
Stars: ✭ 20 (-47.37%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-2.63%)
tinker9Tinker9: Next Generation of Tinker with GPU Support
Stars: ✭ 31 (-18.42%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-42.11%)
genspioGenerate Shell Phrases In OCaml
Stars: ✭ 46 (+21.05%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (+102.63%)
OpenPHParallel reduction of boundary matrices for Persistent Homology with CUDA
Stars: ✭ 14 (-63.16%)
hipercHigh Performance Computing Strategies for Boundary Value Problems
Stars: ✭ 36 (-5.26%)
KRSThe Kria Robotics Stack (KRS) is a ROS 2 superset for industry, an integrated set of robot libraries and utilities to accelerate the development, maintenance and commercialization of industrial-grade robotic solutions while using adaptive computing.
Stars: ✭ 26 (-31.58%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (+47.37%)
environmentsDetermined AI public environments
Stars: ✭ 22 (-42.11%)
CrossbowCrossbow: A Multi-GPU Deep Learning System for Training with Small Batch Sizes
Stars: ✭ 52 (+36.84%)
Jamais-VuAudio Fingerprinting and Recognition in Python using NVidia's CUDA
Stars: ✭ 24 (-36.84%)
pytorch-gpu-data-science-projectTemplate repository for a Python 3-based (data) science project with GPU acceleration using the PyTorch ecosystem.
Stars: ✭ 16 (-57.89%)
dpnpNumPy drop-in replacement for Intel(R) XPUs
Stars: ✭ 42 (+10.53%)
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Stars: ✭ 79 (+107.89%)
RaytrAMPShooting and bouncing rays method for radar cross-section calculations, accelerated with BVH algorithm running on GPU (C++ AMP).
Stars: ✭ 49 (+28.95%)
brian2cudaA brian2 extension to simulate spiking neural networks on GPUs
Stars: ✭ 46 (+21.05%)
cef-mixerHigh Performance off-screen rendering (OSR) demo using CEF
Stars: ✭ 183 (+381.58%)
notebooksA docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
Stars: ✭ 29 (-23.68%)
Guided Missile SimulationGuided Missile, Radar and Infrared EOS Simulation Framework written in Fortran.
Stars: ✭ 33 (-13.16%)
HashedExpressionType-safe modelling DSL, symbolic transformation, and code generation for solving optimization problems.
Stars: ✭ 40 (+5.26%)
opensbliA framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures.
Stars: ✭ 56 (+47.37%)
TfjsA WebGL accelerated JavaScript library for training and deploying ML models.
Stars: ✭ 15,834 (+41568.42%)
qmcA Quasi-Monte-Carlo Integrator Library with CUDA Support
Stars: ✭ 17 (-55.26%)
DreamplaceDeep learning toolkit-enabled VLSI placement
Stars: ✭ 211 (+455.26%)
PySDMPythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab
Stars: ✭ 26 (-31.58%)
gpu-cluster-configHow to Configure a GPU Cluster Running Ubuntu Linux
Stars: ✭ 45 (+18.42%)
HyperformulaA complete, open-source Excel-like calculation engine written in TypeScript. Includes 380+ built-in functions. Maintained by the Handsontable team⚡
Stars: ✭ 210 (+452.63%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+450%)
PHCpackThe primary source code repository for PHCpack, a software package to solve polynomial systems with homotopy continuation methods.
Stars: ✭ 50 (+31.58%)
GpytorchA highly efficient and modular implementation of Gaussian Processes in PyTorch
Stars: ✭ 2,622 (+6800%)