NnvmNo description or website provided.
Stars: ✭ 1,639 (+869.82%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+824.85%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-12.43%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-19.53%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-32.54%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-10.06%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-2.96%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-29.59%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-14.2%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-18.34%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-34.91%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-8.88%)
Multi Gpu Programming ModelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Stars: ✭ 165 (-2.37%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-23.08%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6498.22%)
SporcoSparse Optimisation Research Code
Stars: ✭ 164 (-2.96%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-27.81%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-13.61%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-31.95%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-6.51%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-33.14%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-15.38%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-17.16%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+871.01%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-8.88%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-19.53%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-20.12%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-9.47%)
QudaQUDA is a library for performing calculations in lattice QCD on GPUs.
Stars: ✭ 166 (-1.78%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1001.18%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-10.65%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-24.85%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-2.96%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-11.83%)
Deformable KernelsDeforming kernels to adapt towards object deformation. In ICLR 2020.
Stars: ✭ 166 (-1.78%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-27.81%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-12.43%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-28.4%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-4.73%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-30.18%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-31.95%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-32.54%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1028.99%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-33.14%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-6.51%)
Libgdf[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-15.98%)
DragonDragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
Stars: ✭ 168 (-0.59%)
FloorA C++ Compute/Graphics Library and Toolchain enabling same-source CUDA/Host/Metal/OpenCL/Vulkan C++ programming and execution.
Stars: ✭ 166 (-1.78%)
JcudaJCuda - Java bindings for CUDA
Stars: ✭ 165 (-2.37%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-19.53%)