Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+269.23%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+7412.82%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+5502.56%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-39.74%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-19.23%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+252.56%)
ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-8.97%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+364.1%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+611.54%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+896.15%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+1325.64%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+1032.05%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (-7.69%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (+243.59%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (+8.97%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-17.95%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+338.46%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+383.33%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (+278.21%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+580.77%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+594.87%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+820.51%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+651.28%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-33.33%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1088.46%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+1064.1%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-53.85%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+294.87%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (-29.49%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (-14.1%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+232.05%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (+251.28%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+435.9%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+4634.62%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+4508.97%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+358.97%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (+293.59%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+379.49%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (+374.36%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+433.33%)
warpcontinuous energy monte carlo neutron transport in general geometries on GPUs
Stars: ✭ 27 (-65.38%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-26.92%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+7111.54%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+555.13%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+7151.28%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+688.46%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-3.85%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+493.59%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+1335.9%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1042.31%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+1008.97%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+929.49%)
bazel.cmakebazel.cmake mimics the behavior of bazel to simplify the usability of CMake
Stars: ✭ 38 (-51.28%)
PbfVsImplementation of Macklin, Miles, and Matthias Müller. "Position based fluids.". Visual Studio 2015 + CUDA 8.0
Stars: ✭ 100 (+28.21%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+479.49%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+912.82%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-60.26%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-47.44%)