VuhVulkan compute for people
Stars: ✭ 264 (-83.91%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (-84.22%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (-94.82%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (-81.29%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (-80.93%)
TornadovmTornadoVM: A practical and efficient heterogeneous programming framework for managed languages
Stars: ✭ 479 (-70.81%)
Xray Oxygen🌀 Oxygen Engine 2.0. [Preview] Discord: https://discord.gg/P3aMf66
Stars: ✭ 481 (-70.69%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (-68.86%)
Silk.netThe high-speed OpenAL, OpenGL, Vulkan, and GLFW bindings library your mother warned you about.
Stars: ✭ 534 (-67.46%)
BytecoderRich Domain Model for JVM Bytecode and Framework to interpret and transpile it.
Stars: ✭ 401 (-75.56%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+242.78%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (-66.97%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (-66.18%)
crowdsource-video-experiments-on-androidCrowdsourcing video experiments (such as collaborative benchmarking and optimization of DNN algorithms) using Collective Knowledge Framework across diverse Android devices provided by volunteers. Results are continuously aggregated in the open repository:
Stars: ✭ 29 (-98.23%)
UcxUnified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Stars: ✭ 471 (-71.3%)
Regl CnnDigit recognition with Convolutional Neural Networks in WebGL
Stars: ✭ 490 (-70.14%)
Halidea language for fast, portable data-parallel computation
Stars: ✭ 4,722 (+187.75%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (-63.38%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (-62.52%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+244.67%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (-72.46%)
Tf CorianderOpenCL 1.2 implementation for Tensorflow
Stars: ✭ 775 (-52.77%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (-64.29%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+257.1%)
ClblastTuned OpenCL BLAS
Stars: ✭ 559 (-65.94%)
CorianderBuild NVIDIA® CUDA™ code for OpenCL™ 1.2 devices
Stars: ✭ 665 (-59.48%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (-52.65%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+332.05%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+273.43%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (-56.25%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-96.65%)
GpusortingImplementation of a few sorting algorithms in OpenCL
Stars: ✭ 9 (-99.45%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (-47.29%)
MetalpetalA GPU accelerated image and video processing framework built on Metal.
Stars: ✭ 907 (-44.73%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (-45.7%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (-46.19%)
JuliaThe Julia Programming Language
Stars: ✭ 37,497 (+2185.01%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-97.81%)
ComputesharpA .NET 5 library to run C# code in parallel on the GPU through DX12 and dynamically generated HLSL compute shaders, with the goal of making GPU computing easy to use for all .NET developers! 🚀
Stars: ✭ 982 (-40.16%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-97.75%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (-51.07%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-98.11%)
OpenclpapersA Collection of Articles and other OpenCL Papers
Stars: ✭ 37 (-97.75%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-97.5%)
GeopmGlobal Extensible Open Power Manager
Stars: ✭ 57 (-96.53%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-96.53%)
Clarrays.jlOpenCL-backed GPU Arrays
Stars: ✭ 58 (-96.47%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (-32.24%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-96.83%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (-31.75%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-96.1%)
VbiosfinderExtract embedded VBIOS from (almost) any BIOS Update
Stars: ✭ 64 (-96.1%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-96.04%)
GbrainGPU Javascript Library for Machine Learning
Stars: ✭ 48 (-97.07%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-96.16%)
Sycl DnnSYCL-DNN is a library implementing neural network algorithms written using SYCL
Stars: ✭ 67 (-95.92%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-95.25%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (-95.43%)