🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 930+kb (int8) and 1.7M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Stars: ✭ 1,230 (+192.16%)

Mutual labels: transformer, onnxruntime

torch-model-compression

针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库

Stars: ✭ 126 (-70.07%)

Mutual labels: quantization, onnx

deepvac

PyTorch Project Specification.

Stars: ✭ 507 (+20.43%)

Mutual labels: quantization, onnx

Pinto model zoo

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

Stars: ✭ 634 (+50.59%)

Mutual labels: quantization, onnx

KitanaQA

KitanaQA: Adversarial training and data augmentation for neural question-answering models

Stars: ✭ 58 (-86.22%)

Mutual labels: transformer, question-answering

t5-japanese

Codes to pre-train Japanese T5 models

Stars: ✭ 39 (-90.74%)

Mutual labels: transformer, t5

Micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Stars: ✭ 1,232 (+192.64%)

Mutual labels: quantization, onnx

Yolov5 Rt Stack

Yet another yolov5, with its runtime stack for libtorch, onnx, tvm and specialized accelerators. You like torchvision's retinanet? You like yolov5? You love yolort!

Stars: ✭ 107 (-74.58%)

Mutual labels: inference, onnx

serving-runtime

Exposes a serialized machine learning model through a HTTP API.

Stars: ✭ 15 (-96.44%)

Mutual labels: inference, onnx

ai-serving

Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints

Stars: ✭ 122 (-71.02%)

Mutual labels: inference, onnx

ONNX-Runtime-with-TensorRT-and-OpenVINO

Docker scripts for building ONNX Runtime with TensorRT and OpenVINO in manylinux environment

Stars: ✭ 15 (-96.44%)

Mutual labels: onnx, onnxruntime

ONNX-HITNET-Stereo-Depth-estimation

Python scripts form performing stereo depth estimation using the HITNET model in ONNX.

Stars: ✭ 21 (-95.01%)

Mutual labels: onnx, onnxruntime

Distiller

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Stars: ✭ 3,760 (+793.11%)

Mutual labels: quantization, onnx

Lightseq

LightSeq: A High Performance Inference Library for Sequence Processing and Generation

Stars: ✭ 501 (+19%)

Mutual labels: inference, transformer

Multi Model Server

Multi Model Server is a tool for serving neural net models for inference

Stars: ✭ 770 (+82.9%)

Mutual labels: inference, onnx

Rust Bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

Stars: ✭ 510 (+21.14%)

Mutual labels: transformer, question-answering

mediapipe plus

The purpose of this project is to apply mediapipe to more AI chips.

Stars: ✭ 38 (-90.97%)

Mutual labels: inference, onnx

Cubert

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

Stars: ✭ 395 (-6.18%)

Mutual labels: inference, transformer

Awesome Emdl

Embedded and mobile deep learning research resources

Stars: ✭ 554 (+31.59%)

Mutual labels: inference, quantization

Ml Model Ci

MLModelCI is a complete MLOps platform for managing, converting, profiling, and deploying MLaaS (Machine Learning-as-a-Service), bridging the gap between current ML training and serving systems.

Stars: ✭ 122 (-71.02%)

Mutual labels: inference, onnx

Ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Stars: ✭ 13,376 (+3077.2%)

Mutual labels: inference, onnx

graphsignal

Graphsignal Python agent

Stars: ✭ 158 (-62.47%)

Mutual labels: inference, onnxruntime

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Stars: ✭ 281 (-33.25%)

Mutual labels: quantization, onnx

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

Stars: ✭ 138 (-67.22%)

Mutual labels: quantization, onnx

text2keywords

Trained T5 and T5-large model for creating keywords from text

Stars: ✭ 53 (-87.41%)

Mutual labels: transformer, t5

Mivisionx

MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

Stars: ✭ 100 (-76.25%)

Mutual labels: inference, onnx

image-classification

A collection of SOTA Image Classification Models in PyTorch

Stars: ✭ 70 (-83.37%)

Mutual labels: transformer, quantization

Volksdep

volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.

Stars: ✭ 195 (-53.68%)

Mutual labels: inference, onnx

ONNX-Mobile-Human-Pose-3D

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Stars: ✭ 69 (-83.61%)

Mutual labels: onnx, onnxruntime

vs-mlrt

Efficient ML Filter Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2, and Real-CUGAN)

Stars: ✭ 34 (-91.92%)

Mutual labels: onnx, onnxruntime

verseagility

Ramp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.

Stars: ✭ 23 (-94.54%)

Mutual labels: transformer, question-answering

Turbotransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

Stars: ✭ 826 (+96.2%)

Mutual labels: inference, transformer

Effective transformer

Running BERT without Padding

Stars: ✭ 169 (-59.86%)

Mutual labels: inference, transformer

Libonnx

A lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.

Stars: ✭ 217 (-48.46%)

Mutual labels: inference, onnx

unsupervised-qa

Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering

Stars: ✭ 47 (-88.84%)

Mutual labels: question-answering

pytorch YOLO OpenVINO demo

No description or website provided.

Stars: ✭ 73 (-82.66%)

Mutual labels: onnx

SegSwap

(CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery

Stars: ✭ 46 (-89.07%)

Mutual labels: transformer

rankqa

This is the PyTorch implementation of the ACL 2019 paper RankQA: Neural Question Answering with Answer Re-Ranking.

Stars: ✭ 83 (-80.29%)

Mutual labels: question-answering

Awesome-low-level-vision-resources

A curated list of resources for Low-level Vision Tasks

Stars: ✭ 35 (-91.69%)

Mutual labels: transformer

MASTER-pytorch

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Stars: ✭ 263 (-37.53%)

Mutual labels: transformer

websocket

WebSocket for fasthttp

Stars: ✭ 51 (-87.89%)

Mutual labels: fast

BMT

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

Stars: ✭ 192 (-54.39%)

Mutual labels: transformer

ZAQ-code

CVPR 2021 : Zero-shot Adversarial Quantization (ZAQ)

Stars: ✭ 59 (-85.99%)

Mutual labels: quantization

sb-nmt

Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)

Stars: ✭ 66 (-84.32%)

Mutual labels: transformer

MashaRoBot

MashaRoBot : 📑Editor's choice

Stars: ✭ 39 (-90.74%)

Mutual labels: fast

hcrn-videoqa

Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)

Stars: ✭ 111 (-73.63%)

Mutual labels: question-answering

examinee

Laravel Quiz and Exam System clone of udemy

Stars: ✭ 151 (-64.13%)

Mutual labels: question-answering

TianChi AIEarth

TianChi AIEarth Contest Solution

Stars: ✭ 57 (-86.46%)

Mutual labels: transformer

Transformers-RL

An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"

Stars: ✭ 107 (-74.58%)

Mutual labels: transformer

object-flaw-detector-cpp

Detect various irregularities of a product as it moves along a conveyor belt.

Stars: ✭ 19 (-95.49%)

Mutual labels: inference

DFPlayerMini Fast

Fast and easy to understand Arduino library to use the DFPlayer Mini MP3 module from DFRobot.com. This is a huge improvement (both in terms of execution speed and simplicity) to the standard library provided by DFRobot.com.

Stars: ✭ 164 (-61.05%)

Mutual labels: fast

muparsersse

muparsersse a math parser for windows using just in time compilations of the expression

Stars: ✭ 14 (-96.67%)

Mutual labels: fast

seq2seq-pytorch

Sequence to Sequence Models in PyTorch

Stars: ✭ 41 (-90.26%)

Mutual labels: transformer

fast-speedtest-api

fast.com API / CLI tool