GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-11.31%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-19.05%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-31.55%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-8.93%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-24.4%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-2.38%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-27.98%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-16.67%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-32.74%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-8.33%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-19.64%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1007.74%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-10.12%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-1.19%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-27.38%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-11.9%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-29.76%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-4.17%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-32.14%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1035.71%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-19.05%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-34.52%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-17.86%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-1.79%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-19.05%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-8.33%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-1.19%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+875.6%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-9.52%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-22.62%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-1.79%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6537.5%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (+0%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+830.36%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-11.9%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-27.38%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-2.38%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-29.17%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-31.55%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-2.38%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-32.14%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-13.69%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-32.74%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-5.95%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-14.88%)
Cuda programmingCode from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
Stars: ✭ 169 (+0.6%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-1.19%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-5.95%)
Libgdf[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-15.48%)