Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+436.36%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+245.45%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-42.42%)
CekirdeklerMulti-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
Stars: ✭ 76 (-23.23%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-42.42%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+322.22%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+280.81%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+836.36%)
CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+5581.82%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+701.01%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-63.64%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+111.11%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+1023.23%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-31.31%)
VuhVulkan compute for people
Stars: ✭ 264 (+166.67%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (+59.6%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+265.66%)
EmuThe write-once-run-anywhere GPGPU library for Rust
Stars: ✭ 1,350 (+1263.64%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+356.57%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+356.57%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+416.16%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+447.47%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+460.61%)
NyuziprocessorGPGPU microprocessor architecture
Stars: ✭ 1,351 (+1264.65%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-9.09%)
Open3dOpen3D: A Modern Library for 3D Data Processing
Stars: ✭ 5,860 (+5819.19%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+367.68%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+320.2%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+625.25%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+647.47%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-14.14%)
Opt einsum⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.
Stars: ✭ 397 (+301.01%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+5613.13%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+521.21%)
MprReference implementation for "Massively Parallel Rendering of Complex Closed-Form Implicit Surfaces" (SIGGRAPH 2020)
Stars: ✭ 84 (-15.15%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+507.07%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+684.85%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+697.98%)
Neuralnetwork.netA TensorFlow-inspired neural network library built from scratch in C# 7.3 for .NET Standard 2.0, with GPU support through cuDNN
Stars: ✭ 392 (+295.96%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+491.92%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+658.59%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+7469.7%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+711.11%)
DrlkitA High Level Python Deep Reinforcement Learning library. Great for beginners, prototyping and quickly comparing algorithms
Stars: ✭ 29 (-70.71%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+791.92%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+1194.95%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-68.69%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-21.21%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (+1116.16%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-58.59%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-47.47%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+773.74%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-58.59%)
ArboretumGradient Boosting powered by GPU(NVIDIA CUDA)
Stars: ✭ 64 (-35.35%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-24.24%)
NxMulti-dimensional arrays (tensors) and numerical definitions for Elixir
Stars: ✭ 1,133 (+1044.44%)