OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-35.75%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+931.56%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-80.17%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+358.38%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+4.47%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+5.31%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+16.76%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+29.33%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+158.94%)
ComputeA C++ GPU Computing Library for OpenCL
Stars: ✭ 1,192 (+232.96%)
AparapiThe New Official Aparapi: a framework for executing native Java and Scala code on the GPU.
Stars: ✭ 352 (-1.68%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-67.88%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (-4.47%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-65.92%)
JohnJohn the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
Stars: ✭ 5,656 (+1479.89%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+65.64%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-90.78%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+8.94%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+121.51%)
gpubootcampThis repository consists for gpu bootcamp material for HPC and AI
Stars: ✭ 227 (-36.59%)
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
Stars: ✭ 626 (+74.86%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-63.69%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+2976.54%)
allgebraBase container for developing C++ and Fortran HPC applications
Stars: ✭ 14 (-96.09%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-81.01%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+48.32%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+120.67%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-66.2%)
monolishmonolish: MONOlithic LInear equation Solvers for Highly-parallel architecture
Stars: ✭ 166 (-53.63%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-79.89%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-59.22%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-55.03%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-41.62%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-54.19%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+1.12%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-78.77%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-37.15%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (-23.46%)
gpuowlGPU Mersenne primality test.
Stars: ✭ 77 (-78.49%)
CUDAfy.NETCUDAfy .NET allows easy development of high performance GPGPU applications completely from the .NET. It's developed in C#.
Stars: ✭ 56 (-84.36%)
PlotoptixData visualisation in Python based on OptiX 7.2 ray tracing framework.
Stars: ✭ 252 (-29.61%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+904.19%)
Amplifier.NETAmplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.
Stars: ✭ 142 (-60.34%)
learn-gpgpuAlgorithms implemented in CUDA + resources about GPGPU
Stars: ✭ 37 (-89.66%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (-14.25%)
CLUThe OpenCL Utility library
Stars: ✭ 18 (-94.97%)
SenetSqueeze-and-Excitation Networks
Stars: ✭ 2,850 (+696.09%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-89.11%)
rectdetectRealtime rectangle detector with GPGPU
Stars: ✭ 51 (-85.75%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (-12.57%)
pystellaA code generator for grid-based PDE solving on CPUs and GPUs
Stars: ✭ 18 (-94.97%)
RayTracingRealtime GPU Path tracer based on OpenCL and OpenGL
Stars: ✭ 120 (-66.48%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-76.26%)
HiSpatialClusterClustering spatial points with algorithm of Fast Search, high performace computing implements of CUDA or parallel in CPU, and runnable implements on python standalone or arcgis.
Stars: ✭ 31 (-91.34%)
sycl-benchSYCL Benchmark Suite
Stars: ✭ 30 (-91.62%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-81.84%)
ctuning-programsCollective Knowledge extension with unified and customizable benchmarks (with extensible JSON meta information) to be easily integrated with customizable and portable Collective Knowledge workflows. You can easily compile and run these benchmarks using different compilers, environments, hardware and OS (Linux, MacOS, Windows, Android). More info:
Stars: ✭ 41 (-88.55%)