Top 527 cuda open source projects

Speedtorch
Library for faster pinned CPU <-> GPU transfer in Pytorch
Thundergbm
ThunderGBM: Fast GBDTs and Random Forests on GPUs
Trtorch
PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Xmrig Nvidia
Monero (XMR) NVIDIA miner
Cudasift
A CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Cudamat
Python module for performing basic dense linear algebra computations on the GPU using CUDA.
Nvparse
Fast, gpu-based CSV parser
✭ 533
cuda
Stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
Depthwiseconvolution
A personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)
Rustacuda
Rusty wrapper for the CUDA Driver API
✭ 511
rustgpucuda
Convnet
A GPU implementation of Convolutional Neural Nets in C++
✭ 506
cuda
Lightseq
LightSeq: A High Performance Inference Library for Sequence Processing and Generation
Xray Oxygen
🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Tsdf Fusion Python
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Bitcracker
BitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Tsdf Fusion
Fuse multiple depth frames into a TSDF voxel volume.
Tensorflow Cmake
TensorFlow examples in C, C++, Go and Python without bazel but with cmake and FindTensorFlow.cmake
Accel
(Mirror of GitLab) GPGPU Framework for Rust
Icpcuda
Super fast implementation of ICP in CUDA for compute capable devices 3.5 or higher
✭ 416
cuda
Deformable Convolution Pytorch
PyTorch implementation of Deformable Convolution
✭ 410
cuda
Pytorch Pwc
a reimplementation of PWC-Net in PyTorch that matches the official Caffe version
Cubert
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
Integral Human Pose
Integral Human Pose Regression
✭ 395
cuda
Cudanative.jl
Julia support for native CUDA programming
✭ 393
juliacuda
Ganet
GA-Net: Guided Aggregation Net for End-to-end Stereo Matching
✭ 393
cuda
Neuralnetwork.net
A TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Amgcl
C++ library for solving large sparse linear systems with algebraic multigrid method
Music Translation
A UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.
✭ 385
cuda
Hipsycl
Implementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Ilgpu
ILGPU JIT Compiler for high-performance .Net GPU programs
Cuda.jl
CUDA programming in Julia.
✭ 370
juliagpucuda
Nvpipe
NVIDIA-accelerated zero latency video compression library for interactive remoting applications
✭ 376
cuda
Vuda
VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.
✭ 373
cudavulkan
Mini Caffe
Minimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.
Libsgm
Stereo Semi Global Matching by cuda
✭ 368
cuda
Darkpose
Distribution-Aware Coordinate Representation for Human Pose Estimation
Cuda Api Wrappers
Thin C++-flavored wrappers for the CUDA Runtime API
Arrayfire Python
Python bindings for ArrayFire: A general purpose GPU library.
K2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
✭ 354
cuda
Tutorials
Some basic programming tutorials
Sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Visionaray
A C++-based, cross platform ray tracing library
Cudahandbook
Source code that accompanies The CUDA Handbook.
✭ 345
cuda
Cudpp
CUDA Data Parallel Primitives Library
✭ 333
cuda
241-300 of 527 cuda projects