Tensorflow Object Detection TutorialThe purpose of this tutorial is to learn how to install and prepare TensorFlow framework to train your own convolutional neural network object detection classifier for multiple objects, starting from scratch
Stars: ✭ 113 (-26.14%)
MixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL)
Stars: ✭ 130 (-15.03%)
SpocStream Processing with OCaml
Stars: ✭ 115 (-24.84%)
Region ConvNot All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
Stars: ✭ 95 (-37.91%)
CuheCUDA Homomorphic Encryption Library
Stars: ✭ 109 (-28.76%)
Hoomd BlueMolecular dynamics and Monte Carlo soft matter simulation on GPUs.
Stars: ✭ 143 (-6.54%)
DppDetail-Preserving Pooling in Deep Networks (CVPR 2018)
Stars: ✭ 99 (-35.29%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+7188.24%)
Knn cudaFast K-Nearest Neighbor search with GPU
Stars: ✭ 119 (-22.22%)
ElasticfusionReal-time dense visual SLAM system
Stars: ✭ 1,298 (+748.37%)
SpanetSpatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)
Stars: ✭ 136 (-11.11%)
CltuneCLTune: An automatic OpenCL & CUDA kernel tuner
Stars: ✭ 114 (-25.49%)
GpurirPython library for Room Impulse Response (RIR) simulation with GPU acceleration
Stars: ✭ 145 (-5.23%)
Adacof PytorchOfficial source code for our paper "AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation" (CVPR 2020)
Stars: ✭ 110 (-28.1%)
NnvmNo description or website provided.
Stars: ✭ 1,639 (+971.24%)
HashcatWorld's fastest and most advanced password recovery utility
Stars: ✭ 11,014 (+7098.69%)
Cuda CnnCNN accelerated by cuda. Test on mnist and finilly get 99.76%
Stars: ✭ 148 (-3.27%)
PygraphistryPyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Stars: ✭ 1,365 (+792.16%)
SupraSUPRA: Software Defined Ultrasound Processing for Real-Time Applications - An Open Source 2D and 3D Pipeline from Beamforming to B-Mode
Stars: ✭ 96 (-37.25%)
ForwardA library for high performance deep learning inference on NVIDIA GPUs.
Stars: ✭ 136 (-11.11%)
NumerNumeric Erlang - vector and matrix operations with CUDA. Heavily inspired by Pteracuda - https://github.com/kevsmith/pteracuda
Stars: ✭ 91 (-40.52%)
FcisFully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 1,563 (+921.57%)
BabelstreamSTREAM, for lots of devices written in many programming models
Stars: ✭ 121 (-20.92%)
MatconvnetMatConvNet: CNNs for MATLAB
Stars: ✭ 1,299 (+749.02%)
Marian DevFast Neural Machine Translation in C++ - development repository
Stars: ✭ 136 (-11.11%)
Tensorflow Optimized WheelsTensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
Stars: ✭ 118 (-22.88%)
MtensorA C++ Cuda Tensor Lazy Computing Library
Stars: ✭ 115 (-24.84%)
Partial Order PruningPartial Order Pruning: for Best Speed/Accuracy Trade-off in Neural Architecture Search
Stars: ✭ 135 (-11.76%)
Pytorch spnExtension package for spatial propagation network in pytorch.
Stars: ✭ 114 (-25.49%)
GinkgoNumerical linear algebra software package
Stars: ✭ 149 (-2.61%)
Pytorch Unflow a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Stars: ✭ 113 (-26.14%)
Futhark💥💻💥 A data-parallel functional programming language
Stars: ✭ 1,641 (+972.55%)
RemoterySingle C file, Realtime CPU/GPU Profiler with Remote Web Viewer
Stars: ✭ 1,908 (+1147.06%)
LibcudacxxThe C++ Standard Library for your entire system.
Stars: ✭ 1,861 (+1116.34%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-30.72%)
JetsonHelmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-1.31%)
Cuda WinogradFast CUDA Kernels for ResNet Inference.
Stars: ✭ 104 (-32.03%)
AgencyExecution primitives for C++
Stars: ✭ 127 (-16.99%)
DeepnetDeep.Net machine learning framework for F#
Stars: ✭ 99 (-35.29%)
Libgdf[ARCHIVED] C GPU DataFrame Library
Stars: ✭ 142 (-7.19%)
Extending JaxExtending JAX with custom C++ and CUDA code
Stars: ✭ 98 (-35.95%)
PynvvlA Python wrapper of NVIDIA Video Loader (NVVL) with CuPy for fast video loading with Python
Stars: ✭ 95 (-37.91%)
SketchgraphsA dataset of 15 million CAD sketches with geometric constraint graphs.
Stars: ✭ 148 (-3.27%)
Fbtt EmbeddingThis is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as recommendation and natural language processing. We showed this library can reduce the total model size by up to 100x in Facebook’s open sourced DLRM model while achieving same model quality. Our implementation is faster than the state-of-the-art implementations. Existing the state-of-the-art library also decompresses the whole embedding tables on the fly therefore they do not provide memory reduction during runtime of the training. Our library decompresses only the requested rows therefore can provide 10,000 times memory footprint reduction per embedding table. The library also includes a software cache to store a portion of the entries in the table in decompressed format for faster lookup and process.
Stars: ✭ 92 (-39.87%)
Ctranslate2Fast inference engine for OpenNMT models
Stars: ✭ 140 (-8.5%)
Warp RnntCUDA-Warp RNN-Transducer
Stars: ✭ 122 (-20.26%)
CompactcnncascadeA binary library for very fast face detection using compact CNNs.
Stars: ✭ 152 (-0.65%)
NsimdAgenium Scale vectorization library for CPUs and GPUs
Stars: ✭ 138 (-9.8%)
OnemkloneAPI Math Kernel Library (oneMKL) Interfaces
Stars: ✭ 122 (-20.26%)