Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-29.06%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-29.91%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-15.38%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-27.78%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-34.62%)
AmgxDistributed multigrid linear solver library on GPU
Stars: ✭ 207 (-11.54%)
Softmax Splattingan implementation of softmax splatting for differentiable forward warping using PyTorch
Stars: ✭ 218 (-6.84%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-32.48%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-17.95%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-26.5%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-36.32%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-10.68%)
Cuda programmingCode from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (-27.78%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (-4.7%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-29.06%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1125.64%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+1614.53%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-31.2%)
ViseronSelf-hosted NVR with object detection
Stars: ✭ 192 (-17.95%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-34.19%)
NicehashquickminerSuper simple & easy Windows 10 cryptocurrency miner made by NiceHash.
Stars: ✭ 211 (-9.83%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-35.47%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-20.09%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+970.09%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-36.75%)
HasteHaste: a fast, simple, and open RNN library
Stars: ✭ 214 (-8.55%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-27.78%)
Nvidia Modded InfModified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
Stars: ✭ 227 (-2.99%)
HipHIP: C++ Heterogeneous-Compute Interface for Portability
Stars: ✭ 2,609 (+1014.96%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-28.21%)
CuspatialCUDA-accelerated GIS and spatiotemporal algorithms
Stars: ✭ 229 (-2.14%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-29.06%)
Cunn Stars: ✭ 205 (-12.39%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-29.91%)
Pedestrian alignmentTCSVT2018 Pedestrian Alignment Network for Large-scale Person Re-identification
Stars: ✭ 223 (-4.7%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-29.49%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-13.68%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-29.49%)
Cudnn TrainingA CUDNN minimal deep learning training code sample using LeNet.
Stars: ✭ 231 (-1.28%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-29.91%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-32.48%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (-6.41%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-17.95%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-34.19%)
Pytorch Heda reimplementation of Holistically-Nested Edge Detection in PyTorch
Stars: ✭ 228 (-2.56%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-35.04%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (-18.8%)
TigreTIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Stars: ✭ 215 (-8.12%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+5866.24%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-1.71%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (-1.28%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-3.85%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-8.12%)