CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (+1.81%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (+29.52%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+25.9%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1021.08%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (+15.66%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-21.69%)
Batch ShipyardSimplify HPC and Batch workloads on Azure
Stars: ✭ 240 (+44.58%)
dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Stars: ✭ 65 (-60.84%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+38.55%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-1.2%)
peakperfAchieve peak performance on x86 CPUs and NVIDIA GPUs
Stars: ✭ 33 (-80.12%)
bifrostA stream processing framework for high-throughput applications.
Stars: ✭ 48 (-71.08%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-65.66%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (-39.76%)
Fat-CloudsGPU Fluid Simulation with Volumetric Rendering
Stars: ✭ 81 (-51.2%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (-59.64%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-59.04%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-66.87%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-56.63%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+56.02%)
FLAMEGPU2FLAME GPU 2 is a GPU accelerated agent based modelling framework for C++ and Python
Stars: ✭ 25 (-84.94%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+73.49%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (+65.06%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-26.51%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+65.66%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+106.02%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+2124.7%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+118.07%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+3430.12%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+150.6%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+207.83%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+134.94%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+219.88%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+234.34%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+2532.53%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+332.53%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+3307.23%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+383.73%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+270.48%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+431.93%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+421.08%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-75.3%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+127.11%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+574.7%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+569.88%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-57.23%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+888.55%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-53.01%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+125.3%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-68.67%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-45.78%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+722.29%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-4.82%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-18.67%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-10.24%)
DashDASH, the C++ Template Library for Distributed Data Structures with Support for Hierarchical Locality for HPC and Data-Driven Science
Stars: ✭ 134 (-19.28%)