Top 1817 Cuda open source projects

701. merge-spmv
No description, website, or topics provided.
✭ 62
CudaC++
702. CUDAAdvisor
CUDAAdvisor: a GPU profiling tool
703. Trans2Seg
No description, website, or topics provided.
✭ 130
pythonCuda
704. dynamic-training-with-apache-mxnet-on-aws
Dynamic training with Apache MXNet reduces cost and time for training deep neural networks by leveraging AWS cloud elasticity and scale. The system reduces training cost and time by dynamically updating the training cluster size during training, with minimal impact on model training accuracy.
705. CSPN monodepth
Unofficial Faster PyTorch implementation of Convolutional Spatial Propagation Network
706. cuda spatial deform
A fast tool to do image augmentation on GPU(especially elastic_deform), can be helpful to research on Medical Image.
708. alien
ALIEN is a CUDA-powered artificial life simulation program.
709. AutoDiff
Julia AutoDiff
✭ 28
juliaCuda
711. dpMMlowVar
Bayesian nonparametric small-variance asymptotic clustering algorithms
714. mcx
Monte Carlo eXtreme (MCX) - GPU-accelerated photon transport simulator
715. StacklessRayTracer
Implemented stackless KDTree on GPU to accelerate ray tracing rendering algorithm. Hardware level optimizations for register spills local memory overhead are used.
716. LMCNet
PyTorch implementation of "Learnable Motion Coherence for Correspondence Pruning" CVPR 2021.
717. NCCL
Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.
✭ 21
CudaC++c
718. fasterGICP
an improvement of fast_gicp
720. istar
istar is a software-as-a-service platform for bioinformatics and chemoinformatics.
721. ROSEFusion
[SIGGRAPH 2021] ROSEFusion is proposed to tackle the difficulties in fast-motion camera tracking using random optimization with depth information only.
722. udacity-IntroToParallelProgramming
CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions
724. gslam bowbench
Comparison of DBoW2, DBoW3, FBoW and Vocabulary of GSLAM.
725. TAL SH
Tensor Algebra Library Routines for Shared Memory Systems
727. PartNet
The source code for the TMM paper: Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection Network
729. z0
No description, website, or topics provided.
731. bpr
Bayesian Personalized Ranking using PyTorch
732. outdoor-blind-navigation
Open-Source Sidewalk Navigation Software for Visually-Impaired Individuals using Multithreaded CNN’s
734. Shadow-Removal-via-Generative-Priors
[ACM MM 2021 Oral] Unsupervised Portrait Shadow Removal via Generative Priors
735. GVV-Differentiable-CUDA-Renderer
Differentiable Rasterization-based Renderer implemented in CUDA and C++
736. Yolo on Caffe
Yolo(including yolov1 yolov2 yolov3)running on caffe windows. Anyone that is not familiar with linux can use this project to learn caffe developing
737. yolo cpp
C++ed version of Yolo
✭ 68
C++cCuda
739. cmake-cuda-example
Example of how to use CUDA with CMake >= 3.8
740. 3DFacePointCloudNet
Learning Directly from Synthetic Point Clouds for "In-the-wild" 3D Face Recognition
741. sparse-detr
PyTorch Implementation of Sparse DETR
743. learning with noise
Learning to See by Looking at Noise
744. legate.pandas
An Aspiring Drop-In Replacement for Pandas at Scale
745. OSCAR
Code for ICML 2021 paper: How could Neural Networks understand Programs?
746. warpcore
A Library for fast Hash Tables on GPUs
✭ 61
C++Cuda
749. uoais
Codes of paper "Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling", ICRA 2022
750. VersaPipe
A framework for pipelined computing on GPU