CupyNumPy & SciPy for GPU
Stars: ✭ 5,625 (+15525%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+850%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+905.56%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+947.22%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+15611.11%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+2475%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (+88.89%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (+261.11%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+1375%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (+58.33%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (+136.11%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+2102.78%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (+175%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+2988.89%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (+213.89%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (+136.11%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+1061.11%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (+30.56%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+619.44%)
VuhVulkan compute for people
Stars: ✭ 264 (+633.33%)
Deep DiamondA fast Clojure Tensor & Deep Learning library
Stars: ✭ 288 (+700%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (+644.44%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (+719.44%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-11.11%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+10158.33%)
TutorialsSome basic programming tutorials
Stars: ✭ 353 (+880.56%)
Arrayfire PythonPython bindings for ArrayFire: A general purpose GPU library.
Stars: ✭ 358 (+894.44%)
opencv-cuda-dockerDockerfiles for OpenCV compiled with CUDA, opencv_contrib modules and Python 3 bindings
Stars: ✭ 55 (+52.78%)
hipaccA domain-specific language and compiler for image processing
Stars: ✭ 72 (+100%)
tiny-cuda-nnLightning fast & tiny C++/CUDA neural network framework
Stars: ✭ 908 (+2422.22%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (+661.11%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (+663.89%)
QPT[内测中]前向式Python环境快捷封装工具,快速将Python打包为EXE并添加CUDA、NoAVX等支持。
Stars: ✭ 308 (+755.56%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+9886.11%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+938.89%)
Fast gicpA collection of GICP-based fast point cloud registration algorithms
Stars: ✭ 307 (+752.78%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (+769.44%)
Mini CaffeMinimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.
Stars: ✭ 373 (+936.11%)
CudfcuDF - GPU DataFrame Library
Stars: ✭ 4,370 (+12038.89%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-13.89%)
Cuda.jlCUDA programming in Julia.
Stars: ✭ 370 (+927.78%)
H2o4gpuH2Oai GPU Edition
Stars: ✭ 416 (+1055.56%)
lbvhan implementation of parallel linear BVH (LBVH) on GPU
Stars: ✭ 67 (+86.11%)
BitcrackerBitCracker is the first open source password cracking tool for memory units encrypted with BitLocker
Stars: ✭ 463 (+1186.11%)
RustacudaRusty wrapper for the CUDA Driver API
Stars: ✭ 511 (+1319.44%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+1155.56%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+2352.78%)
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Stars: ✭ 555 (+1441.67%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+1527.78%)
PicongpuParticle-in-Cell Simulations for the Exascale Era ✨
Stars: ✭ 452 (+1155.56%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+1405.56%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+1569.44%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+1608.33%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1986.11%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+1955.56%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+1894.44%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+2058.33%)