Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+168.18%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-60.61%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+345.96%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+298.99%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+368.18%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+465.66%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-68.18%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+863.64%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-38.38%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+1164.65%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+2756.57%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+262.63%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+305.56%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-71.21%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-62.12%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-84.34%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+589.39%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+728.79%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-27.78%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-34.34%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-16.67%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-17.17%)
Ssd Gpu DmaBuild userspace NVMe drivers and storage applications with CUDA support
Stars: ✭ 172 (-13.13%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+210.61%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+195.96%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+292.42%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+180.3%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+6951.01%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+350%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+336.87%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+2740.91%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-73.74%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-79.29%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+461.62%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-81.82%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-64.14%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-67.68%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-57.58%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+173.74%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-50%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-52.02%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-42.93%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-54.04%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-31.31%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+839.9%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-26.26%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-54.55%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-18.69%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-20.2%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-16.16%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-14.65%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+547.47%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-22.22%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-14.65%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-5.56%)