SixtyfourHow fast can we brute force a 64-bit comparison?
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
NbodyN body gravity attraction problem solver
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
CudaExperiments with CUDA and Rust
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Cuda CnnImplementation of a simple CNN using CUDA
Des CudaDES cracking using brute force algorithm and CUDA
CubCooperative primitives for CUDA C++.
GraphviteGraphVite: A General and High-performance Graph Embedding System
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stn3d3D Spatial Transformer Network
CupoissonCUDA implementation of the 2D fast Poisson solver
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Lattice netFast Point Cloud Segmentation Using Permutohedral Lattices
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
CudajacobiCUDA implementation of the Jacobi method
NeuralsuperresolutionReal-time video quality improvement for applications such as video-chat using Perceptual Losses
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
GmatrixR package for unleashing the power of NVIDIA GPU's
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
BlocksparseEfficient GPU kernels for block-sparse matrix multiplication and convolution
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
PyopenclOpenCL integration for Python, plus shiny features
NumbaNumPy aware dynamic Python compiler using LLVM
MarianFast Neural Machine Translation in C++
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
AccelerateEmbedded language for high-performance array computations
JuiceThe Hacker's Machine Learning Engine
KintinuousReal-time large scale dense visual SLAM system
GunrockHigh-Performance Graph Primitives on GPUs
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Nv WavenetReference implementation of real-time autoregressive wavenet inference
ChainerA flexible framework of neural networks for deep learning
Mc CnnStereo Matching by Training a Convolutional Neural Network to Compare Image Patches
SlangMaking it easier to work with shaders
KmcudaLarge scale K-means and K-nn implementation on NVIDIA GPU / CUDA
VexclVexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
TwostreamfusionCode release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.