RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1056.36%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-4.24%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+435.15%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-2.42%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+3309.09%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+386.67%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+440%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-61.21%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+180.61%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-26.06%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+209.7%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-6.67%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+370.91%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+378.79%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-17.58%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+255.15%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+573.94%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+578.79%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-49.09%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-56.97%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+727.27%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+894.55%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+173.94%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1027.88%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+3451.52%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-11.52%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+152.12%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+228.48%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+221.82%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+236.36%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+2548.48%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+335.15%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+3327.88%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-13.33%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+272.73%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+424.24%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+461.82%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-81.21%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+128.48%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-65.45%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-61.82%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-68.48%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-52.73%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-54.55%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+676.97%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-75.15%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-40%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-42.42%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-31.52%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-44.85%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (+124.24%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+126.67%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-78.18%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-45.45%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-21.21%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-0.61%)