KmcudaLarge scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Stars: ✭ 627 (+397.62%)
nodeGPU-accelerated data science and visualization in node
Stars: ✭ 85 (-32.54%)
TwostreamfusionCode release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.
Stars: ✭ 618 (+390.48%)
lane detectionLane detection for the Nvidia Jetson TX2 using OpenCV4Tegra
Stars: ✭ 15 (-88.1%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-54.76%)
libelas-gpuImplementation of LIBELAS in cuda.
Stars: ✭ 41 (-67.46%)
LuxcoreLuxCore source repository
Stars: ✭ 601 (+376.98%)
AresdbA GPU-powered real-time analytics storage and query engine.
Stars: ✭ 2,814 (+2133.33%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-3.17%)
DlamiA Deep Learning Amazon Web Service (AWS) AMI that is open, free and works. Run in less than 5 minutes. TensorFlow, Keras, PyTorch, Theano, MXNet, CNTK, Caffe and all dependencies.
Stars: ✭ 239 (+89.68%)
TaskflowA General-purpose Parallel and Heterogeneous Task Programming System
Stars: ✭ 6,128 (+4763.49%)
Nvbio GplNVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.
Stars: ✭ 56 (-55.56%)
OccaJIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal
Stars: ✭ 230 (+82.54%)
Xmrig NvidiaMonero (XMR) NVIDIA miner
Stars: ✭ 560 (+344.44%)
Optix PathtracerSimple physically based path tracer based on Nvidia's Optix Ray Tracing Engine
Stars: ✭ 231 (+83.33%)
ThundersvmThunderSVM: A Fast SVM Library on GPUs and CPUs
Stars: ✭ 1,282 (+917.46%)
TengineTengine is a lite, high performance, modular inference engine for embedded device
Stars: ✭ 4,012 (+3084.13%)
CudamatPython module for performing basic dense linear algebra computations on the GPU using CUDA.
Stars: ✭ 554 (+339.68%)
CupochRobotics with GPU computing
Stars: ✭ 225 (+78.57%)
Dink点云深度学习框架 | Point cloud Deep learning Framework
Stars: ✭ 56 (-55.56%)
DeepspeechDeepSpeech neon implementation
Stars: ✭ 223 (+76.98%)
Lighthouse2Lighthouse 2 framework for real-time ray tracing
Stars: ✭ 542 (+330.16%)
Softmax Splattingan implementation of softmax splatting for differentiable forward warping using PyTorch
Stars: ✭ 218 (+73.02%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-15.87%)
NicehashquickminerSuper simple & easy Windows 10 cryptocurrency miner made by NiceHash.
Stars: ✭ 211 (+67.46%)
Stdgpustdgpu: Efficient STL-like Data Structures on the GPU
Stars: ✭ 531 (+321.43%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (+70.63%)
BohriumAutomatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Stars: ✭ 209 (+65.87%)
DepthwiseconvolutionA personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)
Stars: ✭ 512 (+306.35%)
AmgxDistributed multigrid linear solver library on GPU
Stars: ✭ 207 (+64.29%)
Pine🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.
Stars: ✭ 202 (+60.32%)
ConvnetA GPU implementation of Convolutional Neural Nets in C++
Stars: ✭ 506 (+301.59%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-58.73%)
TimemoryModular C++ Toolkit for Performance Analysis and Logging. Profiling API and Tools for C, C++, CUDA, Fortran, and Python. The C++ template API is essentially a framework to creating tools: it is designed to provide a unifying interface for recording various performance measurements alongside data logging and interfaces to other tools.
Stars: ✭ 192 (+52.38%)
LightseqLightSeq: A High Performance Inference Library for Sequence Processing and Generation
Stars: ✭ 501 (+297.62%)
Pytorch Spynet a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Stars: ✭ 190 (+50.79%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-8.73%)
Nvidia DockerBuild and run Docker containers leveraging NVIDIA GPUs
Stars: ✭ 13,961 (+10980.16%)
Tsdf Fusion PythonPython code to fuse multiple RGB-D images into a TSDF voxel volume.
Stars: ✭ 464 (+268.25%)
CumlcuML - RAPIDS Machine Learning Library
Stars: ✭ 2,504 (+1887.3%)
Cs344Introduction to Parallel Programming class code
Stars: ✭ 1,051 (+734.13%)
Gmonitorgmonitor is a GPU monitor (Nvidia only at the moment)
Stars: ✭ 169 (+34.13%)
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+258.73%)
Knn cudapytorch knn [cuda version]
Stars: ✭ 86 (-31.75%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+8750%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+1140.48%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-5.56%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-12.7%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-27.78%)
CudadtwGPU-Suite
Stars: ✭ 63 (-50%)
CudajacobiCUDA implementation of the Jacobi method
Stars: ✭ 19 (-84.92%)
PyTorchTOPGPU PyTorch TOP in TouchDesigner with CUDA-enabled OpenCV
Stars: ✭ 58 (-53.97%)