ThorAtmospheric fluid dynamics solver optimized for GPUs.
Stars: ✭ 23 (-56.6%)
Ddsh Tip2018source code for paper "Deep Discrete Supervised Hashing"
Stars: ✭ 16 (-69.81%)
CudaExperiments with CUDA and Rust
Stars: ✭ 31 (-41.51%)
PresentationsSlides and demo code for past presentations
Stars: ✭ 7 (-86.79%)
NumbaNumPy aware dynamic Python compiler using LLVM
Stars: ✭ 7,090 (+13277.36%)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Stars: ✭ 36 (-32.08%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-22.64%)
Scikit CudaPython interface to GPU-powered libraries
Stars: ✭ 803 (+1415.09%)
Des CudaDES cracking using brute force algorithm and CUDA
Stars: ✭ 21 (-60.38%)
Gpu badmm mtBregman ADMM for mass transportation on GPU
Stars: ✭ 10 (-81.13%)
JuiceThe Hacker's Machine Learning Engine
Stars: ✭ 743 (+1301.89%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-32.08%)
ZludaCUDA on Intel GPUs
Stars: ✭ 937 (+1667.92%)
Sepconv Slomoan implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Stars: ✭ 918 (+1632.08%)
WheelsPerformance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
Stars: ✭ 891 (+1581.13%)
LibcudarangeAn interval arithmetic and affine arithmetic library for NVIDIA CUDA
Stars: ✭ 5 (-90.57%)
ArraymancerA fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
Stars: ✭ 793 (+1396.23%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-24.53%)
Ethereum nvidia miner💰 USB flash drive ISO image for Ethereum, Zcash and Monero mining with NVIDIA graphics cards and Ubuntu GNU/Linux (headless)
Stars: ✭ 772 (+1356.6%)
Theano Roi AlignAn implementation of the RoiAlign operation for Theano
Stars: ✭ 11 (-79.25%)
KintinuousReal-time large scale dense visual SLAM system
Stars: ✭ 740 (+1296.23%)
Smallpt Parallel Bvh GpuA GPU implementation of smallpt (http://www.kevinbeason.com/smallpt/) with Bounding Volume Hierarchy (BVH) tree.
Stars: ✭ 36 (-32.08%)
Stn3d3D Spatial Transformer Network
Stars: ✭ 8 (-84.91%)
Docs PytorchDeep Object Co-Segmentation
Stars: ✭ 43 (-18.87%)
CupoissonCUDA implementation of the 2D fast Poisson solver
Stars: ✭ 7 (-86.79%)
Cure Stars: ✭ 36 (-32.08%)
NeanderthalFast Clojure Matrix Library
Stars: ✭ 927 (+1649.06%)
HornetHornet data structure for sparse dynamic graphs and matrices
Stars: ✭ 49 (-7.55%)
Lattice netFast Point Cloud Segmentation Using Permutohedral Lattices
Stars: ✭ 23 (-56.6%)
KttKernel Tuning Toolkit
Stars: ✭ 33 (-37.74%)
CudajacobiCUDA implementation of the Jacobi method
Stars: ✭ 19 (-64.15%)
Qualia2.0Qualia is a deep learning framework deeply integrated with automatic differentiation and dynamic graphing with CUDA acceleration. Qualia was built from scratch.
Stars: ✭ 41 (-22.64%)
NeuralsuperresolutionReal-time video quality improvement for applications such as video-chat using Perceptual Losses
Stars: ✭ 18 (-66.04%)
Simple Sh DatascienceA collection of Bash scripts and Dockerfiles to install data science Tool, Lib and application
Stars: ✭ 32 (-39.62%)
GmatrixR package for unleashing the power of NVIDIA GPU's
Stars: ✭ 16 (-69.81%)
HungariangpuAn GPU/CUDA implementation of the Hungarian algorithm
Stars: ✭ 51 (-3.77%)
CudadbclusteringClustering via Graphics Processor, using NVIDIA CUDA sdk to preform database clustering on the massively parallel graphics card processor
Stars: ✭ 6 (-88.68%)
Cuda word splitThis project is an old code for Chinese words split. It is written by CUDA at 2010, so it could not run well directly under you platform without an GPU card.
Stars: ✭ 31 (-41.51%)
Pytorch Losslabel-smooth, amsoftmax, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Stars: ✭ 812 (+1432.08%)
Octree SlamLarge octree map construction and rendering with CUDA and OpenGL
Stars: ✭ 40 (-24.53%)
BlocksparseEfficient GPU kernels for block-sparse matrix multiplication and convolution
Stars: ✭ 797 (+1403.77%)
Cuda CnnImplementation of a simple CNN using CUDA
Stars: ✭ 29 (-45.28%)
PyopenclOpenCL integration for Python, plus shiny features
Stars: ✭ 790 (+1390.57%)
Slic cudaSuperpixel SLIC for GPU (CUDA)
Stars: ✭ 45 (-15.09%)
MarianFast Neural Machine Translation in C++
Stars: ✭ 777 (+1366.04%)
CubCooperative primitives for CUDA C++.
Stars: ✭ 883 (+1566.04%)
AccelerateEmbedded language for high-performance array computations
Stars: ✭ 751 (+1316.98%)
Style Feature Reshufflecaffe implementation of "Arbitrary Style Transfer with Deep Feature Reshuffle"
Stars: ✭ 38 (-28.3%)
GraphviteGraphVite: A General and High-performance Graph Embedding System
Stars: ✭ 865 (+1532.08%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-1.89%)
Cs344Introduction to Parallel Programming class code
Stars: ✭ 1,051 (+1883.02%)
Lyra Stars: ✭ 43 (-18.87%)
Soul EnginePhysically based renderer and simulation engine for real-time applications.
Stars: ✭ 37 (-30.19%)
UammdA CUDA project for Molecular Dynamics, Brownian Dynamics, Hydrodynamics... intended to simulate a very generic system constructing a simulation with modules.
Stars: ✭ 11 (-79.25%)