Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-26.06%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-30.91%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-15.15%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-35.76%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1056.36%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-28.48%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-8.48%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+894.55%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-18.18%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-23.03%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-40%)
DsmnetDomain-invariant Stereo Matching Networks
Stars: ✭ 153 (-7.27%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-26.67%)
Libgdf[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-13.94%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-30.3%)
Xmrminer🐜 A CUDA based miner for Monero
Stars: ✭ 158 (-4.24%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-31.52%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-17.58%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-9.7%)
Cuda WinogradFast CUDA Kernels for ResNet Inference.
Stars: ✭ 104 (-36.97%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-21.21%)
DppDetail-Preserving Pooling in Deep Networks (CVPR 2018)
Stars: ✭ 99 (-40%)
Optical Flow FilterA real time optical flow algorithm implemented on GPU
Stars: ✭ 146 (-11.52%)
Cumf alsCUDA Matrix Factorization Library with Alternating Least Square (ALS)
Stars: ✭ 154 (-6.67%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6658.18%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-12.12%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+847.27%)
ClojurecudaClojure library for CUDA development
Stars: ✭ 158 (-4.24%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-26.06%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-13.33%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-27.88%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-7.88%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-30.3%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-17.58%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-30.91%)
Cx db8a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)
Stars: ✭ 164 (-0.61%)
Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-31.52%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-16.36%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-33.33%)
CuheCUDA Homomorphic Encryption Library
Stars: ✭ 109 (-33.94%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-17.58%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+6575.15%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+727.27%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-10.3%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+893.33%)
PrimitivA Neural Network Toolkit.
Stars: ✭ 164 (-0.61%)
KhivaAn open-source library of algorithms to analyse time series in GPU and CPU.
Stars: ✭ 161 (-2.42%)
RmmRAPIDS Memory Manager
Stars: ✭ 154 (-6.67%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-10.3%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1027.88%)