WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+418.02%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-63.37%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+546.51%)
CreepminerBurstcoin C++ CPU and GPU Miner
Stars: ✭ 169 (-1.74%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+854.07%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+359.3%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-69.77%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-66.86%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-4.65%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+693.6%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-8.14%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1009.3%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+240.7%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+3188.37%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-81.98%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-79.07%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-4.07%)
Node NtfsWindows NT File System (NTFS) file system driver
Stars: ✭ 18 (-89.53%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-51.16%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-42.44%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-47.67%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+981.98%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-20.93%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-3.49%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-6.4%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+222.67%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+257.56%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+3170.35%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+351.74%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+317.44%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+366.86%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+215.12%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+413.37%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+402.91%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-76.16%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+438.95%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-10.47%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+551.16%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+208.72%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-54.65%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-56.4%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+645.35%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-58.72%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-44.77%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-47.09%)
KdiskmarkA simple open-source disk benchmark tool for Linux distros
Stars: ✭ 152 (-11.63%)
Rpi Vk DriverVK driver for the Raspberry Pi (Broadcom Videocore IV)
Stars: ✭ 1,160 (+574.42%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-24.42%)
CrossplatformdisktestWindows, macOS and Android storage (HDD, SSD, RAM) speed testing/performance benchmarking app
Stars: ✭ 123 (-28.49%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-16.86%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-29.07%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+197.09%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-62.79%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-34.3%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-15.12%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (-1.74%)