GOSHAn ultra-fast, GPU-based large graph embedding algorithm utilizing a novel coarsening algorithm requiring not more than a single GPU.
Stars: ✭ 12 (-96.6%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+162.61%)
HeCBenchsoftware.intel.com/content/www/us/en/develop/articles/repo-evaluating-performance-productivity-oneapi.html
Stars: ✭ 85 (-75.92%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-81.59%)
MatXAn efficient C++17 GPU numerical computing library with Python-like syntax
Stars: ✭ 418 (+18.41%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+2.55%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+215.01%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+112.75%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-83.85%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-55.24%)
rbcudaCUDA bindings for Ruby
Stars: ✭ 57 (-83.85%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+124.65%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-57.79%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-88.39%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+70.25%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-89.8%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+50.42%)
HipsyclImplementation of SYCL for CPUs, AMD GPUs, NVIDIA GPUs
Stars: ✭ 377 (+6.8%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-71.95%)
cuda memtestFork of CUDA GPU memtest 👓
Stars: ✭ 68 (-80.74%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (-3.12%)
FbcudaFacebook's CUDA extensions.
Stars: ✭ 275 (-22.1%)
WebclglGPGPU Javascript library 🐸
Stars: ✭ 313 (-11.33%)
Awesome CudaThis is a list of useful libraries and resources for CUDA development.
Stars: ✭ 274 (-22.38%)
Knn CudaFast k nearest neighbor search using GPU
Stars: ✭ 310 (-12.18%)
HemiSimple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.
Stars: ✭ 275 (-22.1%)
CudppCUDA Data Parallel Primitives Library
Stars: ✭ 333 (-5.67%)
Unsupervised VideosUnsupervised Learning of Video Representations using LSTMs
Stars: ✭ 309 (-12.46%)
Go CyberYour 🔵 Superintelligence
Stars: ✭ 270 (-23.51%)
GprmaxgprMax is open source software that simulates electromagnetic wave propagation using the Finite-Difference Time-Domain (FDTD) method for numerical modelling of Ground Penetrating Radar (GPR)
Stars: ✭ 268 (-24.08%)
Dynamicfusion Implementation of Newcombe et al. CVPR 2015 DynamicFusion paper
Stars: ✭ 267 (-24.36%)
VisionarayA C++-based, cross platform ray tracing library
Stars: ✭ 342 (-3.12%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+946.18%)
Person Reid ganICCV2017 Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro
Stars: ✭ 301 (-14.73%)
ClojureclClojureCL is a Clojure library for parallel computations with OpenCL.
Stars: ✭ 266 (-24.65%)
VuhVulkan compute for people
Stars: ✭ 264 (-25.21%)
Kinectfusionlib Implementation of the KinectFusion approach in modern C++14 and CUDA
Stars: ✭ 261 (-26.06%)
BrainsimulatorBrain Simulator is a platform for visual prototyping of artificial intelligence architectures.
Stars: ✭ 262 (-25.78%)
3GPU-accelerated micromagnetic simulator
Stars: ✭ 324 (-8.22%)
Cuda voxelizerCUDA Voxelizer to convert polygon meshes into annotated voxel grids
Stars: ✭ 299 (-15.3%)
PopsiftPopSift is an implementation of the SIFT algorithm in CUDA.
Stars: ✭ 259 (-26.63%)
instant-ngpInstant neural graphics primitives: lightning fast NeRF and more
Stars: ✭ 1,863 (+427.76%)
Deep High Resolution Net.pytorchThe project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
Stars: ✭ 3,521 (+897.45%)
gpu-monitorScript to remotely check GPU servers for free GPUs
Stars: ✭ 85 (-75.92%)
LuisaRenderHigh-Performance Multiple-Backend Renderer Based on LuisaCompute
Stars: ✭ 47 (-86.69%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+0%)
NimtorchPyTorch - Python + Nim
Stars: ✭ 346 (-1.98%)
CutorchA CUDA backend for Torch7
Stars: ✭ 322 (-8.78%)
KomputationKomputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.
Stars: ✭ 295 (-16.43%)
Ffmpeg Build ScriptThe FFmpeg build script provides an easy way to build a static FFmpeg on OSX and Linux with non-free codecs included.
Stars: ✭ 290 (-17.85%)
Torch-TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
Stars: ✭ 1,216 (+244.48%)
PyTorchTOPGPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Stars: ✭ 58 (-83.57%)
JitifyA single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
Stars: ✭ 314 (-11.05%)
Open quadtree mappingThis is a monocular dense mapping system corresponding to IROS 2018 "Quadtree-accelerated Real-time Monocular Dense Mapping"
Stars: ✭ 292 (-17.28%)