fastT5⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Stars: ✭ 421 (-25.75%)
onnxruntime-rsRust wrapper for Microsoft's ONNX Runtime (version 1.8)
Stars: ✭ 149 (-73.72%)
concurrent-video-analytic-pipeline-optimization-sample-lCreate a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-93.12%)
LibonnxA lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.
Stars: ✭ 217 (-61.73%)
Nlp ArchitectA model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Stars: ✭ 2,768 (+388.18%)
vs-mlrtEfficient ML Filter Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2, and Real-CUGAN)
Stars: ✭ 34 (-94%)
sparsifyEasy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint
Stars: ✭ 138 (-75.66%)
Tf2An Open Source Deep Learning Inference Engine Based on FPGA
Stars: ✭ 113 (-80.07%)
Ml Model CiMLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging the gap between current ML training and serving systems.
Stars: ✭ 122 (-78.48%)
studio-lab-examplesExample notebooks for working with SageMaker Studio Lab. Sign up for an account at the link below!
Stars: ✭ 319 (-43.74%)
sagemaker-xgboost-containerThis is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use their own XGBoost scripts in SageMaker.
Stars: ✭ 93 (-83.6%)
ai-servingServing AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints
Stars: ✭ 122 (-78.48%)
popartPoplar Advanced Runtime for the IPU
Stars: ✭ 62 (-89.07%)
Multi Model ServerMulti Model Server is a tool for serving neural net models for inference
Stars: ✭ 770 (+35.8%)
MivisionxMIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
Stars: ✭ 100 (-82.36%)
Volksdepvolksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
Stars: ✭ 195 (-65.61%)
Pinto model zooA repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (+11.82%)
Model OptimizationA toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Stars: ✭ 992 (+74.96%)
Awesome System For Machine LearningA curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
Stars: ✭ 1,185 (+108.99%)
object-flaw-detector-cppDetect various irregularities of a product as it moves along a conveyor belt.
Stars: ✭ 19 (-96.65%)
Bmw Labeltool LiteThis repository provides you with a easy to use labeling tool for State-of-the-art Deep Learning training purposes.
Stars: ✭ 145 (-74.43%)
intruder-detector-pythonBuild an application that alerts you when someone enters a restricted area. Learn how to use models for multiclass object detection.
Stars: ✭ 16 (-97.18%)
object-size-detector-pythonMonitor mechanical bolts as they move down a conveyor belt. When a bolt of an irregular size is detected, this solution emits an alert.
Stars: ✭ 26 (-95.41%)
motor-defect-detector-pythonPredict performance issues with manufacturing equipment motors. Perform local or cloud analytics of the issues found, and then display the data on a user interface to determine when failures might arise.
Stars: ✭ 24 (-95.77%)
mediapipe plusThe purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (-93.3%)
chainer-fcis[This project has moved to ChainerCV] Chainer Implementation of Fully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 45 (-92.06%)
ONNX-Mobile-Human-Pose-3DPython scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.
Stars: ✭ 69 (-87.83%)
DistillerNeural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
Stars: ✭ 3,760 (+563.14%)
Dawn Bench EntriesDAWNBench: An End-to-End Deep Learning Benchmark and Competition
Stars: ✭ 254 (-55.2%)
Awesome EmdlEmbedded and mobile deep learning research resources
Stars: ✭ 554 (-2.29%)
serving-runtimeExposes a serialized machine learning model through a HTTP API.
Stars: ✭ 15 (-97.35%)
Yolov5 Rt StackYet another yolov5, with its runtime stack for libtorch, onnx, tvm and specialized accelerators. You like torchvision's retinanet? You like yolov5? You love yolort!
Stars: ✭ 107 (-81.13%)
Micronetmicronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
Stars: ✭ 1,232 (+117.28%)
Ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
Stars: ✭ 13,376 (+2259.08%)
Onnxt5Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
Stars: ✭ 143 (-74.78%)
ppqPPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Stars: ✭ 281 (-50.44%)
graphsignalGraphsignal Python agent
Stars: ✭ 158 (-72.13%)
yaskYASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-difference methods and similar applications.
Stars: ✭ 81 (-85.71%)
People Counter PythonCreate a smart video application using the Intel Distribution of OpenVINO toolkit. The toolkit uses models and inference to run single-class object detection.
Stars: ✭ 62 (-89.07%)
Amazon Sagemaker ExamplesExample 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
Stars: ✭ 6,346 (+1019.22%)
openvino pytorch layersHow to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO
Stars: ✭ 17 (-97%)
deepvacPyTorch Project Specification.
Stars: ✭ 507 (-10.58%)
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Stars: ✭ 56 (-90.12%)
safety-gear-detector-pythonObserve workers as they pass in front of a camera to determine if they have adequate safety protection.
Stars: ✭ 54 (-90.48%)
LaplacianOpt.jlA Julia/JuMP Package for Maximizing Algebraic Connectivity of Undirected Weighted Graphs
Stars: ✭ 16 (-97.18%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+491.71%)
LBFGS-LiteLBFGS-Lite: A header-only L-BFGS unconstrained optimizer.
Stars: ✭ 98 (-82.72%)
ddcpuid🔬 dd's x86 CPU Identification tool
Stars: ✭ 21 (-96.3%)
mysql tuning-cookbookChef cookbook to create MySQL configuraiton files better suited for your system.
Stars: ✭ 23 (-95.94%)