HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+138.61%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-63.92%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-63.92%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+236.08%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+116.46%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-37.34%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-5.7%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-77.22%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+32.28%)
gpuvmemGPU Framework for Radio Astronomical Image Synthesis
Stars: ✭ 27 (-82.91%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-46.2%)
ObsidianObsidian Language Repository
Stars: ✭ 38 (-75.95%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+164.56%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+68.35%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+123.42%)
JampackExperimental parallel compression algorithm
Stars: ✭ 21 (-86.71%)
Tf Quant FinanceHigh-performance TensorFlow library for quantitative finance.
Stars: ✭ 2,925 (+1751.27%)
Montecarlomeasurements.jlPropagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
Stars: ✭ 168 (+6.33%)
gpuhdMassively Parallel Huffman Decoding on GPUs
Stars: ✭ 30 (-81.01%)
runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-89.24%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-70.25%)
VuhVulkan compute for people
Stars: ✭ 264 (+67.09%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-8.23%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+129.11%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-56.96%)
Neuralnetwork.netA TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Stars: ✭ 392 (+148.1%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+280.38%)
PysnnEfficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
Stars: ✭ 129 (-18.35%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-13.92%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+486.71%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-74.05%)
CARECHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
Stars: ✭ 22 (-86.08%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+603.8%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-58.86%)
GpufitGPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Stars: ✭ 174 (+10.13%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+401.9%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-92.41%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+375.32%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-51.9%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+754.43%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-12.66%)
FastflowFastFlow pattern-based parallel programming framework (formerly on sourceforge)
Stars: ✭ 137 (-13.29%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-13.92%)
Go Sessions🔐 The sessions manager for the Go Programming Language. Supports both net/http and fasthttp.
Stars: ✭ 134 (-15.19%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-6.33%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-14.56%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (+0%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-2.53%)
StitchemVahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
Stars: ✭ 147 (-6.96%)
LindbLinDB is a scalable, high performance, high availability distributed time series database.
Stars: ✭ 2,105 (+1232.28%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-6.33%)
Hedgehog LabRun, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY in your browser, matrix operations with GPU acceleration, TeX support, data visualization and symbolic computation.
Stars: ✭ 1,797 (+1037.34%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+937.34%)
DlafDiffusion-limited aggregation, fast.
Stars: ✭ 156 (-1.27%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1077.85%)