Data visualisation in Python based on OptiX 7.2 ray tracing framework.
A GPU-powered real-time analytics storage and query engine.
A Deep Learning Amazon Web Service (AWS) AMI that is open, free and works. Run in less than 5 minutes. TensorFlow, Keras, PyTorch, Theano, MXNet, CNTK, Caffe and all dependencies.
My fork of Alex Krizhevsky's cuda-convnet from 2013 where I added dropout, among other features.
package cu provides an idiomatic interface to the CUDA Driver API.
JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
A CUDNN minimal deep learning training code sample using LeNet.
Simple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
CUDA-accelerated GIS and spatiotemporal algorithms
Tengine is a lite, high performance, modular inference engine for embedded device
a reimplementation of Holistically-Nested Edge Detection in PyTorch
Robotics with GPU computing
Nvidia Modded Inf
Modified nVidia .inf files to run drivers on all video cards, research & telemetry free drivers
an implementation of softmax splatting for differentiable forward warping using PyTorch
Image-processing software for cryo-electron microscopy
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
SDK for GPU accelerated genome assembly and analysis
Haste: a fast, simple, and open RNN library
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
HIP: C++ Heterogeneous-Compute Interface for Portability
Distributed multigrid linear solver library on GPU
🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Self-hosted NVR with object detection
Modular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Collective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Build and run Docker containers leveraging NVIDIA GPUs
cuML - RAPIDS Machine Learning Library
Ssd Gpu Dma
Build userspace NVMe drivers and storage applications with CUDA support
gmonitor is a GPU monitor (Nvidia only at the moment)
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Dragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
A C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
QUDA is a library for performing calculations in lattice QCD on GPUs.
Sparse Optimisation Research Code
JCuda - Java bindings for CUDA
a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
An open-source library of algorithms to analyse time series in GPU and CPU.
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
Domain-invariant Stereo Matching Networks