micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

✭ 1,232

python pytorch onnx convolutional-networks quantization model-compression pruning

Vectorsinsearch

Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015

✭ 71

python elasticsearch search-engine word2vec vector information-retrieval solr quantization lucene glove kmeans

Dsq

pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"

✭ 70

python pytorch quantization

Ntagger

reference pytorch code for named entity tagging

✭ 58

python pytorch ner crf densenet quantization sequence-labeling pruning glove

Jacinto Ai Devkit

Training & Quantization of embedded friendly Deep Learning / Machine Learning / Computer Vision models

✭ 49

deep-learning pytorch tensorflow object-detection cnn deeplearning segmentation detection semantic-segmentation caffe resnet mobilenet quantization

Quantization.mxnet

Simulate quantization and quantization aware training for MXNet-Gluon models.

✭ 42

python mxnet quantization gluon

Model Optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

✭ 992

python deep-learning machine-learning tensorflow keras optimization compression ml quantization model-compression pruning

Sai

SDK for TEE AI Stick (includes model training script, inference library, examples)

✭ 28

python computer-vision iot cnn quantization

Training extensions

Trainable models and NN optimization tools

✭ 857

python deep-learning pytorch tensorflow computer-vision segmentation detection face-recognition super-resolution ssd quantization text-recognition text-detection

Libimagequant Rust

libimagequant (pngquant) bindings for the Rust language

✭ 17

rust-library quantization pngquant

Awesome Automl And Lightweight Models

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

✭ 691

pytorch tensorflow awesome-list automl hyperparameter-optimization meta-learning neural-architecture-search quantization model-compression nas

Paddleslim

PaddleSlim is an open-source library for deep model compression and architecture search.

✭ 677

python neural-architecture-search quantization model-compression nas pruning

Pinto model zoo

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

✭ 634

python pytorch tensorflow computer-vision keras caffe coreml pretrained-models onnx models quantization tensorflow-models

Paddleclas

A treasure chest for image classification powered by PaddlePaddle

✭ 625

python image-classification data-augmentation quantization

Awesome Emdl

Embedded and mobile deep learning research resources

✭ 554

deep-learning deep-neural-networks inference quantization pruning

Aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

✭ 453

python deep-learning machine-learning open-source deep-neural-networks opensource compression quantization pruning

Deephash

An Open-Source Package for Deep Learning to Hash (DeepHash)

✭ 417

python deep-learning hashing quantization

Pngquant

Lossy PNG compressor — pngquant command based on libimagequant library

✭ 4,086

c rust shell Roff Makefile png conversion quality palette quantization image-optimization pngquant png-compression smaller stdin

Brevitas

Brevitas: quantization-aware training in PyTorch

✭ 343

python pytorch neural-networks speech-recognition image-classification fpga text-to-speech quantization

Libimagequant

Palette quantization library that powers pngquant and other PNG optimizers

✭ 344

c visual-studio conversion quality palette quantization callback image-optimization minification pngquant

Distiller

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

✭ 3,760

Jupyter Notebook python jupyter-notebook pytorch deep-neural-networks onnx quantization pruning regularization group-lasso distillation truncated-svd network-compression pruning-structures early-exit automl-for-compression

Deephash Papers

Must-read papers on deep learning to hash (DeepHash)

✭ 302

deep-learning hashing quantization

Finn

Dataflow compiler for QNN inference on FPGAs

✭ 284

python neural-network compiler fpga quantization dataflow

Qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

✭ 254

python deep-learning machine-learning tensorflow keras fpga quantization accelerator

Pretrained Language Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

✭ 2,033

python model-compression quantization knowledge-distillation pretrained-language-model

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

✭ 138