Top 626 gpu open source projects

Cloudml
R interface to Google Cloud Machine Learning Engine
Nx
Multi-dimensional arrays (tensors) and numerical definitions for Elixir
Arboretum
Gradient Boosting powered by GPU(NVIDIA CUDA)
Vbiosfinder
Extract embedded VBIOS from (almost) any BIOS Update
Ggnn
GGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Tsne Cuda
GPU Accelerated t-SNE for CUDA with Python bindings
Aiopen
AIOpen是一个按人工智能三要素(数据、算法、算力)进行AI开源项目分类的汇集项目,项目致力于跟踪目前人工智能(AI)的深度学习(DL)开源项目,并尽可能地罗列目前的开源项目,同时加入了一些曾经研究过的代码。通过这些开源项目,使初次接触AI的人们对人工智能(深度学习)有更清晰和更全面的了解。
Pycuda
CUDA integration for Python, plus shiny features
Memory Efficient Maml
Memory efficient MAML using gradient checkpointing
Rayray
A tiny GPU raytracer, using Zig and WebGPU
Clarrays.jl
OpenCL-backed GPU Arrays
Dain Vulkan Gui
AI-Powered video interpolater (eg. 30fps -> 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg
Gpuview
A lightweight web dashboard for monitoring GPU usage
Heteroflow
Concurrent CPU-GPU Programming using Task Models
Carlsim3
CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
✭ 52
gpucuda
Gpu Spline Deformation
Baking spline deformation to a texture then applying it to a mesh via a shader.
Multiple Smi
Python bindings for pyNVML and psutil library over network
Qualia2.0
Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Basic pathtracer
A basic GPU pathtracer in unity
Go Transcode
Live on-demand transcoding in go using ffmpeg. Also with NVIDIA GPU hardware acceleration.
Gpudashboard
A simple dashboard for NVIDIA GPU
Computesharp
A .NET 5 library to run C# code in parallel on the GPU through DX12 and dynamically generated HLSL compute shaders, with the goal of making GPU computing easy to use for all .NET developers! 🚀
Nvidia libs test
Tests and benchmarks for cudnn (and in the future, other nvidia libraries)
Saber
Window-Based Hybrid CPU/GPU Stream Processing Engine
Computeraster
Real-time software rasterizer using compute shaders, including vertex processing stage (IA and vertex shaders), bin rasterization, tile rasterization (coarse rasterization), and pixel rasterization (fine rasterization, which calls the pixel shaders).
✭ 32
gpu
Cuda
Experiments with CUDA and Rust
Vkquake
Vulkan Quake port based on QuakeSpasm
Drlkit
A High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Keras object detection
Convert any classification model or architecture trained in keras to an object detection model
Cdpvideorecord
An video camera,you can have realtime of a beautify,and change camera position,or turn on/off flash.Details see demo.
Alacritty
Alacritty is a modern terminal emulator that comes with sensible defaults, but allows for extensive configuration. By integrating with other applications, rather than reimplementing their functionality, it manages to provide a flexible set of features with high performance. The supported platforms currently consist of BSD, Linux, macOS and Windows.
Autooffload.jl
Automatic GPU, TPU, FPGA, Xeon Phi, Multithreaded, Distributed, etc. offloading for scientific machine learning (SciML) and differential equations
Metalpetal
A GPU accelerated image and video processing framework built on Metal.
Cub
Cooperative primitives for CUDA C++.
Docker Tensorflow Keras Gpu
Run Tensorflow and Keras with GPU support on Kubernetes
Graphvite
GraphVite: A General and High-performance Graph Embedding System
Ksim
The little simulator that could.
Gpusorting
Implementation of a few sorting algorithms in OpenCL
Daskmaskrcnn
Running Mask-RCNN on Dask with PyTorch
Fieldplay
A vector field explorer
Jetsonjs
Embed a JavaScript/WebGL application on a Nvidia Jetson TX2 and stream the results through websockets. It does not rely on CUDA/Jetpack. HDMI touchscreen, virtual keyboard, GPIO control, wifi config are included.
Wheels
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Turbotransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
Tensorflow.jl
A Julia wrapper for TensorFlow
Scikit Cuda
Python interface to GPU-powered libraries
Tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Makie.jl
High level plotting on the GPU.
Marian
Fast Neural Machine Translation in C++
Fancontrol.releases
This is the release repository for Fan Control, a highly customizable fan controlling software for Windows.
Tf Coriander
OpenCL 1.2 implementation for Tensorflow