Torch samplingEfficient reservoir sampling implementation for PyTorch
Stars: ✭ 68 (-68.37%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-23.26%)
GanetGA-Net: Guided Aggregation Net for End-to-end Stereo Matching
Stars: ✭ 393 (+82.79%)
AmgclC++ library for solving large sparse linear systems with algebraic multigrid method
Stars: ✭ 390 (+81.4%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-40.93%)
Music TranslationA UNIVERSAL MUSIC TRANSLATION NETWORK - a method for translating music across musical instruments and styles.
Stars: ✭ 385 (+79.07%)
OmatsuriPWA with 12 open source frontend focused tools
Stars: ✭ 1,131 (+426.05%)
IlgpuILGPU JIT Compiler for high-performance .Net GPU programs
Stars: ✭ 374 (+73.95%)
HipHIP: C++ Heterogeneous-Compute Interface for Portability
Stars: ✭ 2,609 (+1113.49%)
NvpipeNVIDIA-accelerated zero latency video compression library for interactive remoting applications
Stars: ✭ 376 (+74.88%)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Stars: ✭ 64 (-70.23%)
Mini CaffeMinimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.
Stars: ✭ 373 (+73.49%)
DarkposeDistribution-Aware Coordinate Representation for Human Pose Estimation
Stars: ✭ 369 (+71.63%)
CudadtwGPU-Suite
Stars: ✭ 63 (-70.7%)
Cuda Api WrappersThin C++-flavored wrappers for the CUDA Runtime API
Stars: ✭ 362 (+68.37%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-23.72%)
K2FSA/FST algorithms, differentiable, with PyTorch compatibility.
Stars: ✭ 354 (+64.65%)
CutlassCUDA Templates for Linear Algebra Subroutines
Stars: ✭ 1,123 (+422.33%)
SleefSIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Stars: ✭ 353 (+64.19%)
NimtorchPyTorch - Python + Nim
Stars: ✭ 346 (+60.93%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (+420.93%)
BayaderaHigh-performance Bayesian Data Analysis on the GPU in Clojure
Stars: ✭ 342 (+59.07%)
Fast Ide🕺Fast Integrated Development Environment 😻
Stars: ✭ 181 (-15.81%)
ArrayfireArrayFire: a general purpose GPU library.
Stars: ✭ 3,693 (+1617.67%)
MinkowskiengineMinkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Stars: ✭ 1,110 (+416.28%)
CutorchA CUDA backend for Torch7
Stars: ✭ 322 (+49.77%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-43.26%)
Cuda ProgrammingSample codes for my CUDA programming book
Stars: ✭ 313 (+45.58%)
MpmSimulating on GPU using Material Point Method and rendering.
Stars: ✭ 61 (-71.63%)
TideA General Toolbox for Identifying Object Detection Errors
Stars: ✭ 309 (+43.72%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-26.51%)
Knn CudaFast k nearest neighbor search using GPU
Stars: ✭ 310 (+44.19%)
GraffitiMinimalistic GraphQL framework
Stars: ✭ 306 (+42.33%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-43.72%)
Cuda voxelizerCUDA Voxelizer to convert polygon meshes into annotated voxel grids
Stars: ✭ 299 (+39.07%)
Deep High Resolution Net.pytorchThe project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
Stars: ✭ 3,521 (+1537.67%)
SimplegpuhashtableA simple GPU hash table implemented in CUDA using lock free techniques
Stars: ✭ 198 (-7.91%)
Ffmpeg Build ScriptThe FFmpeg build script provides an easy way to build a static FFmpeg on OSX and Linux with non-free codecs included.
Stars: ✭ 290 (+34.88%)
Cuarrays.jlA Curious Cumulation of CUDA Cuisine
Stars: ✭ 283 (+31.63%)
Cuda SamplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
Stars: ✭ 1,087 (+405.58%)
Torch Toolbox[Active development]ToolBox to make using Pytorch much easier.Give it a star if you feel helpful.
Stars: ✭ 268 (+24.65%)
Hzproctorch data augmentation toolbox (supports affine transform)
Stars: ✭ 56 (-73.95%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (+0%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (-2.79%)
Cunn Stars: ✭ 205 (-4.65%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (-11.63%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-22.79%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-36.74%)
Deeppipe2Deep Learning library using GPU(CUDA/cuBLAS)
Stars: ✭ 90 (-58.14%)
ToolbeltA toolbelt of useful classes and functions to be used with python-requests
Stars: ✭ 748 (+247.91%)