ArrayfireArrayFire: a general purpose GPU library.
3GPU-accelerated micromagnetic simulator
JitifyA single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
ThrustThe C++ parallel algorithms library.
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Knn CudaFast k nearest neighbor search using GPU
Person Reid ganICCV2017 Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro
Cuda voxelizerCUDA Voxelizer to convert polygon meshes into annotated voxel grids
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Ffmpeg Build ScriptThe FFmpeg build script provides an easy way to build a static FFmpeg on OSX and Linux with non-free codecs included.
Open quadtree mappingThis is a monocular dense mapping system corresponding to IROS 2018 "Quadtree-accelerated Real-time Monocular Dense Mapping"
Pytorch Liteflownet a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version
Tensor StreamA library for real-time video stream decoding to CUDA memory
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
FbcudaFacebook's CUDA extensions.
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Dynamicfusion Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Kinectfusionlib Implementation of the KinectFusion approach in modern C++14 and CUDA
BrainsimulatorBrain Simulator is a platform for visual prototyping of artificial intelligence architectures.
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
OneflowOneFlow is a performance-centered and open-source deep learning framework.
instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
gpu-monitorScript to remotely check GPU servers for free GPUs
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
PyTorchTOPGPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
desertA fast (?) random sampling drawing library
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
CPP-ProgrammingVarious C/C++ examples. DirectX, OpenGL, CUDA, Vulkan, OpenCL.
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
hipaccA domain-specific language and compiler for image processing
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
CudaSHA256Simple tool to calculate sha256 on GPU using Cuda
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
octotigerAstrophysics program simulating the evolution of star systems based on the fast multipole method on adaptive Octrees
lbvhan implementation of parallel linear BVH (LBVH) on GPU
ThrustRTCCUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.
disptoolsGenerate displacement fields with known volume changes
mini-nbodyA simple gravitational N-body simulation in less than 100 lines of C code, with CUDA optimizations.
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
dynamic-occupancy-grid-mapImplementation of A Random Finite Set Approach for Dynamic Occupancy Grid Maps with Real-Time Application
cressetTemplate repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake