ParenchymaAn extensible HPC framework for CUDA, OpenCL and native CPU.
Stars: ✭ 71 (-67.43%)
DeepjointfilterThe source code of ECCV16 'Deep Joint Image Filtering'.
Stars: ✭ 68 (-68.81%)
AlenkaGPU database engine
Stars: ✭ 1,150 (+427.52%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+651.83%)
Autodock GpuAutoDock for GPUs and other accelerators
Stars: ✭ 65 (-70.18%)
Ck CaffeCollective Knowledge workflow for Caffe to automate installation across diverse platforms and to collaboratively evaluate and optimize Caffe-based workloads across diverse hardware, software and data sets (compilers, libraries, tools, models, inputs):
Stars: ✭ 192 (-11.93%)
Cudadrv.jlA Julia wrapper for the CUDA driver API.
Stars: ✭ 64 (-70.64%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-40.37%)
Mpn Cov@ICCV2017: For exploiting second-order statistics, we propose Matrix Power Normalized Covariance pooling (MPN-COV) ConvNets, different from and outperforming those using global average pooling.
Stars: ✭ 63 (-71.1%)
GgnnGGNN: State of the Art Graph-based GPU Nearest Neighbor Search
Stars: ✭ 63 (-71.1%)
Gdax Orderbook MlApplication of machine learning to the Coinbase (GDAX) orderbook
Stars: ✭ 60 (-72.48%)
AmgxDistributed multigrid linear solver library on GPU
Stars: ✭ 207 (-5.05%)
PycudaCUDA integration for Python, plus shiny features
Stars: ✭ 1,112 (+410.09%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+5015.14%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-24.77%)
Flattened CnnFlattened convolutional neural networks (1D convolution modules for Torch nn)
Stars: ✭ 59 (-72.94%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+616.97%)
DokaiCollection of Docker images for ML/DL and video processing projects
Stars: ✭ 58 (-73.39%)
Macos Egpu Cuda GuideSet up CUDA for machine learning (and gaming) on macOS using a NVIDIA eGPU
Stars: ✭ 187 (-14.22%)
HeteroflowConcurrent CPU-GPU Programming using Task Models
Stars: ✭ 57 (-73.85%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-44.04%)
Nvbio GplNVBIO is a library of reusable components designed to accelerate bioinformatics applications using CUDA.
Stars: ✭ 56 (-74.31%)
MobulaopA Simple & Flexible Cross Framework Operators Toolkit
Stars: ✭ 161 (-26.15%)
Pyhpc BenchmarksA suite of benchmarks to test the sequential CPU and GPU performance of most popular high-performance libraries for Python.
Stars: ✭ 119 (-45.41%)
Pytorch EmdlossPyTorch 1.0 implementation of the approximate Earth Mover's Distance
Stars: ✭ 82 (-62.39%)
GenomeworksSDK for GPU accelerated genome assembly and analysis
Stars: ✭ 215 (-1.38%)
Carlsim3CARLsim is an efficient, easy-to-use, GPU-accelerated software framework for simulating large-scale spiking neural network (SNN) models with a high degree of biological detail.
Stars: ✭ 52 (-76.15%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-45.87%)
Cs344Introduction to Parallel Programming class code
Stars: ✭ 1,051 (+382.11%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-27.52%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-47.25%)
Lyra Stars: ✭ 43 (-80.28%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-47.71%)
SixtyfourHow fast can we brute force a 64-bit comparison?
Stars: ✭ 41 (-81.19%)
NbodyN body gravity attraction problem solver
Stars: ✭ 40 (-81.65%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-36.7%)
Modulated Deform Convdeformable convolution 2D 3D DeformableConvolution DeformConv Modulated Pytorch CUDA
Stars: ✭ 81 (-62.84%)
OneflowOneFlow is a performance-centered and open-source deep learning framework.
Stars: ✭ 2,868 (+1215.6%)
Nvidia libs testTests and benchmarks for cudnn (and in the future, other nvidia libraries)
Stars: ✭ 36 (-83.49%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+652.75%)
Object Detection And Location Realsensed435Use the Intel D435 real-sensing camera to realize target detection based on the Yolov3 framework under the Opencv DNN framework, and realize the 3D positioning of the Objection according to the depth information. Real-time display of the coordinates in the camera coordinate system.ADD--Using Yolov5 By TensorRT model,AGX-Xavier,RealTime Object Detection
Stars: ✭ 36 (-83.49%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-29.36%)
Nnabla Ext CudaA CUDA Extension of Neural Network Libraries
Stars: ✭ 79 (-63.76%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-23.85%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-37.61%)
2016 super resolutionICCV2015 Image Super-Resolution Using Deep Convolutional Networks
Stars: ✭ 78 (-64.22%)
Cuda Design PatternsSome CUDA design patterns and a bit of template magic for CUDA
Stars: ✭ 78 (-64.22%)
HiopHPC solver for nonlinear optimization problems
Stars: ✭ 75 (-65.6%)
Cudart.jlJulia wrapper for CUDA runtime API
Stars: ✭ 75 (-65.6%)
Nvidia Gpu Tensor Core Accelerator Pytorch OpencvA complete machine vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA-X, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.
Stars: ✭ 110 (-49.54%)