ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1120%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+824.62%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+480%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+426.15%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1326.15%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (+155.38%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+253.85%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-13.85%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-36.92%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-36.92%)
SoliditySHA3MinerAll-in-one mixed multi-GPU (nVidia, AMD, Intel) & CPU miner solves proof of work to mine supported EIP918 tokens in a single instance (with API).
Stars: ✭ 28 (-56.92%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (+10.77%)
ClvkExperimental implementation of OpenCL on Vulkan
Stars: ✭ 158 (+143.08%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (+147.69%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+221.54%)
FastA framework for GPU based high-performance medical image processing and visualization
Stars: ✭ 179 (+175.38%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+30.77%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+1610.77%)
gardeniaGARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (-66.15%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-43.08%)
Trisycl Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
Stars: ✭ 354 (+444.62%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+450.77%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+475.38%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+456.92%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+500%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+612.31%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (+133.85%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+2421.54%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (+152.31%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (+100%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (+210.77%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (+195.38%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (+18.46%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (+86.15%)
EtalerA flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
Stars: ✭ 79 (+21.54%)
dlprimitivesDeep Learning Primitives and Mini-Framework for OpenCL
Stars: ✭ 65 (+0%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-12.31%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-43.08%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-12.31%)
GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-81.54%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+543.08%)
SpocStream Processing with OCaml
Stars: ✭ 115 (+76.92%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+5581.54%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+443.08%)
BlendluxcoreBlender Integration for LuxCore
Stars: ✭ 287 (+341.54%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-44.62%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+464.62%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (+309.23%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-49.23%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+716.92%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+863.08%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+1043.08%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+2424.62%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (+75.38%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-55.38%)
Xray Oxygen🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Stars: ✭ 481 (+640%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1055.38%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+1115.38%)