CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-85.71%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+1750%)
Theano Roi AlignAn implementation of the RoiAlign operation for Theano
Stars: ✭ 11 (-73.81%)
NeuralsuperresolutionReal-time video quality improvement for applications such as video-chat using Perceptual Losses
Stars: ✭ 18 (-57.14%)
Warp CtcPytorch Bindings for warp-ctc
Stars: ✭ 684 (+1528.57%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+2002.38%)
BlocksparseEfficient GPU kernels for block-sparse matrix multiplication and convolution
Stars: ✭ 797 (+1797.62%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-21.43%)
KintinuousReal-time large scale dense visual SLAM system
Stars: ✭ 740 (+1661.9%)
CupoissonCUDA implementation of the 2D fast Poisson solver
Stars: ✭ 7 (-83.33%)
CudajacobiCUDA implementation of the Jacobi method
Stars: ✭ 19 (-54.76%)
SlangMaking it easier to work with shaders
Stars: ✭ 627 (+1392.86%)
Cuda CnnImplementation of a simple CNN using CUDA
Stars: ✭ 29 (-30.95%)
GmatrixR package for unleashing the power of NVIDIA GPU's
Stars: ✭ 16 (-61.9%)
Cure Stars: ✭ 36 (-14.29%)
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Stars: ✭ 812 (+1833.33%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+1959.52%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+1780.95%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-9.52%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1688.1%)
Stn3d3D Spatial Transformer Network
Stars: ✭ 8 (-80.95%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+1609.52%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-23.81%)
ChainerA flexible framework of neural networks for deep learning
Stars: ✭ 5,656 (+13366.67%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+2107.14%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+2085.71%)
KmcudaLarge scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Stars: ✭ 627 (+1392.86%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-14.29%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+2021.43%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-50%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-61.9%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-4.76%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-88.1%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+1811.9%)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Stars: ✭ 36 (-14.29%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1788.1%)
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-73.81%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+16780.95%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-2.38%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+1738.1%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-76.19%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+1669.05%)
Deep Painterly HarmonizationCode and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
Stars: ✭ 6,027 (+14250%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-83.33%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (+1542.86%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-11.9%)
Nv WavenetReference implementation of real-time autoregressive wavenet inference
Stars: ✭ 681 (+1521.43%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+2130.95%)
Mc CnnStereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Stars: ✭ 638 (+1419.05%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-26.19%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-45.24%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-2.38%)
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
Stars: ✭ 40 (-4.76%)
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Stars: ✭ 36 (-14.29%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-26.19%)
Lattice netFast Point Cloud Segmentation Using Permutohedral Lattices
Stars: ✭ 23 (-45.24%)