Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Stars: ✭ 1,087 (+376.75%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+26.32%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-27.63%)
Pytorch Liteflownet a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version
Stars: ✭ 281 (+23.25%)
Hzproctorch data augmentation toolbox (supports affine transform)
Stars: ✭ 56 (-75.44%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (+20.18%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+585.53%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+20.61%)
3d Ken Burnsan implementation of 3D Ken Burns Effect from a Single Image using PyTorch
Stars: ✭ 1,073 (+370.61%)
Go CyberYour 🔵 Superintelligence
Stars: ✭ 270 (+18.42%)
Pedestrian alignmentTCSVT2018 Pedestrian Alignment Network for Large-scale Person Re-identification
Stars: ✭ 223 (-2.19%)
Dynamicfusion Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Stars: ✭ 267 (+17.11%)
PamtriPAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification (ICCV 2019) - Official PyTorch Implementation
Stars: ✭ 53 (-76.75%)
BrainsimulatorBrain Simulator is a platform for visual prototyping of artificial intelligence architectures.
Stars: ✭ 262 (+14.91%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-46.49%)
instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
Stars: ✭ 1,863 (+717.11%)
HungariangpuAn GPU/CUDA implementation of the Hungarian algorithm
Stars: ✭ 51 (-77.63%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-79.39%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-27.63%)
HornetHornet data structure for sparse dynamic graphs and matrices
Stars: ✭ 49 (-78.51%)
PyTorchTOPGPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Stars: ✭ 58 (-74.56%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-47.81%)
cuda2GLcoreImplementation of Cuda to OpenGL rendering
Stars: ✭ 46 (-79.82%)
Slic cudaSuperpixel SLIC for GPU (CUDA)
Stars: ✭ 45 (-80.26%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-75.88%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-15.79%)
mbsolveAn open-source solver tool for the Maxwell-Bloch equations.
Stars: ✭ 14 (-93.86%)
Docs PytorchDeep Object Co-Segmentation
Stars: ✭ 43 (-81.14%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+298.25%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-49.56%)
CudaSHA256Simple tool to calculate sha256 on GPU using Cuda
Stars: ✭ 38 (-83.33%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-82.02%)
revisiting-sepconvan implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Stars: ✭ 43 (-81.14%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-28.07%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (-70.61%)
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
Stars: ✭ 40 (-82.46%)
disptoolsGenerate displacement fields with known volume changes
Stars: ✭ 17 (-92.54%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-50%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+83.33%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-83.33%)
cressetTemplate repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
Stars: ✭ 573 (+151.32%)
HasteHaste: a fast, simple, and open RNN library
Stars: ✭ 214 (-6.14%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (-56.14%)
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Stars: ✭ 36 (-84.21%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-50.44%)
cuda-toolkitGitHub Action to install CUDA
Stars: ✭ 34 (-85.09%)
Cure Stars: ✭ 36 (-84.21%)
ClothTOPGPU-accelerated Cloth TOP node for TouchDesigner using the NVIDIA Flex physics solver.
Stars: ✭ 33 (-85.53%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-30.7%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-85.53%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-1.32%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (-2.19%)
NicehashquickminerSuper simple & easy Windows 10 cryptocurrency miner made by NiceHash.
Stars: ✭ 211 (-7.46%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1157.89%)
Cuda programmingCode from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (-25.88%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-38.6%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-67.11%)