Cuda programmingCode from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (-28.99%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1105.04%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-27.73%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-35.29%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-12.18%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-30.25%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (-6.3%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-32.35%)
ViseronSelf-hosted NVR with object detection
Stars: ✭ 192 (-19.33%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-36.55%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-9.66%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-28.99%)
CupochRobotics with GPU computing
Stars: ✭ 225 (-5.46%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-30.25%)
AmgxDistributed multigrid linear solver library on GPU
Stars: ✭ 207 (-13.03%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (-2.94%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-31.09%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-16.81%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-33.61%)
Softmax Splattingan implementation of softmax splatting for differentiable forward warping using PyTorch
Stars: ✭ 218 (-8.4%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-35.71%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-19.33%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+5765.97%)
TigreTIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
Stars: ✭ 215 (-9.66%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+952.1%)
Pytorch Heda reimplementation of Holistically-Nested Edge Detection in PyTorch
Stars: ✭ 228 (-4.2%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-28.99%)
HasteHaste: a fast, simple, and open RNN library
Stars: ✭ 214 (-10.08%)
Cudnn TrainingA CUDNN minimal deep learning training code sample using LeNet.
Stars: ✭ 231 (-2.94%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-29.41%)
HipHIP: C++ Heterogeneous-Compute Interface for Portability
Stars: ✭ 2,609 (+996.22%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-30.25%)
Nvidia Modded InfModified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
Stars: ✭ 227 (-4.62%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-31.09%)
Cunn Stars: ✭ 205 (-13.87%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-30.67%)
Cupackage cu provides an idiomatic interface to the CUDA Driver API.
Stars: ✭ 234 (-1.68%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-30.67%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (-15.13%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-31.09%)
Pedestrian alignmentTCSVT2018 Pedestrian Alignment Network for Large-scale Person Re-identification
Stars: ✭ 223 (-6.3%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-33.61%)
CuspatialCUDA-accelerated GIS and spatiotemporal algorithms
Stars: ✭ 229 (-3.78%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-35.29%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (-19.33%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-36.13%)
RelionImage-processing software for cryo-electron microscopy
Stars: ✭ 219 (-7.98%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (-20.17%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (-3.36%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+1585.71%)
NicehashquickminerSuper simple & easy Windows 10 cryptocurrency miner made by NiceHash.
Stars: ✭ 211 (-11.34%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-21.43%)