Modulated Deform Convdeformable convolution 2D 3D DeformableConvolution DeformConv Modulated Pytorch CUDA
Stars: ✭ 81 (-64.94%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-38.1%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (-67.53%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-26.84%)
TitanA high-performance CUDA-based physics simulation sandbox for soft robotics and reinforcement learning.
Stars: ✭ 73 (-68.4%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-41.13%)
HipHIP: C++ Heterogeneous-Compute Interface for Portability
Stars: ✭ 2,609 (+1029.44%)
Torch samplingEfficient reservoir sampling implementation for PyTorch
Stars: ✭ 68 (-70.56%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-40.26%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-72.29%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-41.13%)
CudadtwGPU-Suite
Stars: ✭ 63 (-72.73%)
Pedestrian alignmentTCSVT2018 Pedestrian Alignment Network for Large-scale Person Re-identification
Stars: ✭ 223 (-3.46%)
CutlassCUDA Templates for Linear Algebra Subroutines
Stars: ✭ 1,123 (+386.15%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+384.85%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-27.27%)
MinkowskiengineMinkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Stars: ✭ 1,110 (+380.52%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+609.52%)
MpmSimulating on GPU using Material Point Method and rendering.
Stars: ✭ 61 (-73.59%)
Cunn Stars: ✭ 205 (-11.26%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-43.72%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-28.14%)
Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Stars: ✭ 1,087 (+370.56%)
Pytorch Heda reimplementation of Holistically-Nested Edge Detection in PyTorch
Stars: ✭ 228 (-1.3%)
Hzproctorch data augmentation toolbox (supports affine transform)
Stars: ✭ 56 (-75.76%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+4727.27%)
3d Ken Burnsan implementation of 3D Ken Burns Effect from a Single Image using PyTorch
Stars: ✭ 1,073 (+364.5%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-29%)
PamtriPAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification (ICCV 2019) - Official PyTorch Implementation
Stars: ✭ 53 (-77.06%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+576.62%)
HungariangpuAn GPU/CUDA implementation of the Hungarian algorithm
Stars: ✭ 51 (-77.92%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-12.55%)
HornetHornet data structure for sparse dynamic graphs and matrices
Stars: ✭ 49 (-78.79%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-47.19%)
Slic cudaSuperpixel SLIC for GPU (CUDA)
Stars: ✭ 45 (-80.52%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-28.57%)
Docs PytorchDeep Object Co-Segmentation
Stars: ✭ 43 (-81.39%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-48.48%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-82.25%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (-5.19%)
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
Stars: ✭ 40 (-82.68%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-50.22%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-83.55%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-28.57%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-50.65%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+0%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+1636.8%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (-3.46%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-9.52%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-25.54%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+725.97%)
Nnabla Ext CudaA CUDA Extension of Neural Network Libraries
Stars: ✭ 79 (-65.8%)