Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
HallocA fast and highly scalable GPU dynamic memory allocator
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
MinhashcudaWeighted MinHash implementation on CUDA (multi-gpu).
Python Opencv Cudacustom opencv_contrib module which exposes opencv cuda optical flow methods with python bindings
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Pytorch EmdlossPyTorch 1.0 implementation of the approximate Earth Mover's Distance
Modulated Deform Convdeformable convolution 2D 3D DeformableConvolution DeformConv Modulated Pytorch CUDA
HiopHPC solver for nonlinear optimization problems
TitanA high-performance CUDA-based physics simulation sandbox for soft robotics and reinforcement learning.
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Mpn Cov@ICCV2017: For exploiting second-order statistics, we propose Matrix Power Normalized Covariance pooling (MPN-COV) ConvNets, different from and outperforming those using global average pooling.
CutlassCUDA Templates for Linear Algebra Subroutines
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
MinkowskiengineMinkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
PycudaCUDA integration for Python, plus shiny features
MpmSimulating on GPU using Material Point Method and rendering.
Flattened CnnFlattened convolutional neural networks (1D convolution modules for Torch nn)
DokaiCollection of Docker images for ML/DL and video processing projects
HeteroflowConcurrent CPU-GPU Programming using Task Models
Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Nvbio GplNVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.
Hzproctorch data augmentation toolbox (supports affine transform)
Dink点云深度学习框架 | Point cloud Deep learning Framework
3d Ken Burnsan implementation of 3D Ken Burns Effect from a Single Image using PyTorch
PamtriPAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification (ICCV 2019) - Official PyTorch Implementation
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
HungariangpuAn GPU/CUDA implementation of the Hungarian algorithm
Cs344Introduction to Parallel Programming class code
HornetHornet data structure for sparse dynamic graphs and matrices
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.