Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+1538.78%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-57.14%)
ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-53.06%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+1416.33%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-36.73%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-67.35%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-24.49%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+14369.39%)
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-77.55%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+1812.24%)
Cuda Convnet2Automatically exported from code.google.com/p/cuda-convnet2
Stars: ✭ 690 (+1308.16%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+1773.47%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-18.37%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1718.37%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-89.8%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1518.37%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+1475.51%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-26.53%)
Deep Painterly HarmonizationCode and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
Stars: ✭ 6,027 (+12200%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-79.59%)
CupoissonCUDA implementation of the 2D fast Poisson solver
Stars: ✭ 7 (-85.71%)
Warp CtcPytorch Bindings for warp-ctc
Stars: ✭ 684 (+1295.92%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-32.65%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1791.84%)
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
Stars: ✭ 40 (-18.37%)
Lattice netFast Point Cloud Segmentation Using Permutohedral Lattices
Stars: ✭ 23 (-53.06%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-34.69%)
CudajacobiCUDA implementation of the Jacobi method
Stars: ✭ 19 (-61.22%)
Docs PytorchDeep Object Co-Segmentation
Stars: ✭ 43 (-12.24%)
NeuralsuperresolutionReal-time video quality improvement for applications such as video-chat using Perceptual Losses
Stars: ✭ 18 (-63.27%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-36.73%)
GmatrixR package for unleashing the power of NVIDIA GPU's
Stars: ✭ 16 (-67.35%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-22.45%)
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-87.76%)
Cuda CnnImplementation of a simple CNN using CUDA
Stars: ✭ 29 (-40.82%)
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Stars: ✭ 812 (+1557.14%)
Slic cudaSuperpixel SLIC for GPU (CUDA)
Stars: ✭ 45 (-8.16%)
BlocksparseEfficient GPU kernels for block-sparse matrix multiplication and convolution
Stars: ✭ 797 (+1526.53%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+1702.04%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+1512.24%)
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Stars: ✭ 36 (-26.53%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+1485.71%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+1665.31%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1432.65%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-16.33%)
KintinuousReal-time large scale dense visual SLAM system
Stars: ✭ 740 (+1410.2%)
Theano Roi AlignAn implementation of the RoiAlign operation for Theano
Stars: ✭ 11 (-77.55%)
GunrockHigh-Performance Graph Primitives on GPUs
Stars: ✭ 718 (+1365.31%)
Cure Stars: ✭ 36 (-26.53%)
Stn3d3D Spatial Transformer Network
Stars: ✭ 8 (-83.67%)
Lyra Stars: ✭ 43 (-12.24%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-16.33%)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Stars: ✭ 36 (-26.53%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-85.71%)