runtimeAnyDSL Runtime Library
Stars: ✭ 17 (-73.85%)
TvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Stars: ✭ 7,494 (+11429.23%)
Go CyberYour 🔵 Superintelligence
Stars: ✭ 270 (+315.38%)
bandicoot-codeBandicoot: GPU accelerator add-on for the Armadillo C++ linear algebra library
Stars: ✭ 21 (-67.69%)
TwostreamfusionCode release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.
Stars: ✭ 618 (+850.77%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (+312.31%)
euler2d cudaFortran2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)
Stars: ✭ 24 (-63.08%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-41.54%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-50.77%)
SpeedtorchLibrary for faster pinned CPU <-> GPU transfer in Pytorch
Stars: ✭ 615 (+846.15%)
Dynamicfusion Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Stars: ✭ 267 (+310.77%)
SwiftOpenCLA swift wrapper around OpenCL. Modelled off the cpp wrapper
Stars: ✭ 17 (-73.85%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+443.08%)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Stars: ✭ 64 (-1.54%)
VuhVulkan compute for people
Stars: ✭ 264 (+306.15%)
hpcLearning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
Stars: ✭ 39 (-40%)
Vulkan KomputeGeneral purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases.
Stars: ✭ 350 (+438.46%)
fluctusAn interactive OpenCL wavefront path tracer
Stars: ✭ 55 (-15.38%)
PipecnnAn OpenCL-based FPGA Accelerator for Convolutional Neural Networks
Stars: ✭ 775 (+1092.31%)
NimtorchPyTorch - Python + Nim
Stars: ✭ 346 (+432.31%)
Vc4clOpenCL implementation running on the VideoCore IV GPU of the Raspberry Pi models
Stars: ✭ 611 (+840%)
Kinectfusionlib Implementation of the KinectFusion approach in modern C++14 and CUDA
Stars: ✭ 261 (+301.54%)
coriander-dnnPartial implementation of NVIDIA® cuDNN API for Coriander, OpenCL 1.2
Stars: ✭ 22 (-66.15%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-20%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-52.31%)
BrainsimulatorBrain Simulator is a platform for visual prototyping of artificial intelligence architectures.
Stars: ✭ 262 (+303.08%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (+298.46%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+1095.38%)
notebooksA docker-based starter kit for machine learning via jupyter notebooks. Designed for those who just want a runtime environment and get on with machine learning. Docker tags:
Stars: ✭ 29 (-55.38%)
CudppCUDA Data Parallel Primitives Library
Stars: ✭ 333 (+412.31%)
Hzproctorch data augmentation toolbox (supports affine transform)
Stars: ✭ 56 (-13.85%)
fahbenchFolding@home GPU benchmark
Stars: ✭ 32 (-50.77%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+1087.69%)
PysphA framework for Smoothed Particle Hydrodynamics in Python
Stars: ✭ 223 (+243.08%)
Xmrig AmdMonero AMD (OpenCL) miner
Stars: ✭ 322 (+395.38%)
instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
Stars: ✭ 1,863 (+2766.15%)
JitifyA single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
Stars: ✭ 314 (+383.08%)
Compute RuntimeIntel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver
Stars: ✭ 593 (+812.31%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (+30.77%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-27.69%)
PrimestereomatchA heterogeneous and fully parallel stereo matching algorithm for depth estimation, implementing a local adaptive support weight (ADSW) Guided Image Filter (GIF) cost aggregation stage. Developed in both C++ and OpenCL.
Stars: ✭ 191 (+193.85%)
ThrustThe C++ parallel algorithms library.
Stars: ✭ 3,595 (+5430.77%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-52.31%)
ThundergbmThunderGBM: Fast GBDTs and Random Forests on GPUs
Stars: ✭ 586 (+801.54%)
Knn CudaFast k nearest neighbor search using GPU
Stars: ✭ 310 (+376.92%)
Kubernetes Gpu GuideThis guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
Stars: ✭ 740 (+1038.46%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+9327.69%)
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+1770.77%)
PyTorchTOPGPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Stars: ✭ 58 (-10.77%)
Raspberrypi tempmonRaspberry pi CPU temperature monitor with many functions such as logging, GPIO output, graphing, email, alarm, notifications and stress testing. Python 3.
Stars: ✭ 52 (-20%)
TrtorchPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 583 (+796.92%)
MOTMulti-threaded Optimization Toolbox
Stars: ✭ 28 (-56.92%)