Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → quic → Aimet

quic / Aimet

Licence: other

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

deep-learning machine-learning open-source deep-neural-networks opensource compression quantization pruning

Projects that are alternatives of or similar to Aimet

SSD-Pruning-and-quantization

Pruning and quantization for SSD. Model compression.

Stars: ✭ 19 (-95.81%)

Mutual labels: compression, pruning, quantization

Model Optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Stars: ✭ 992 (+118.98%)

Mutual labels: quantization, pruning, compression

Nncf

PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference

Stars: ✭ 218 (-51.88%)

Mutual labels: quantization, pruning, compression

Distiller

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Stars: ✭ 3,760 (+730.02%)

Mutual labels: deep-neural-networks, quantization, pruning

Awesome Emdl

Embedded and mobile deep learning research resources

Stars: ✭ 554 (+22.3%)

Mutual labels: deep-neural-networks, quantization, pruning

fasterai1

FasterAI: A repository for making smaller and faster models with the FastAI library.

Stars: ✭ 34 (-92.49%)

Mutual labels: compression, pruning

ATMC

[NeurIPS'2019] Shupeng Gui, Haotao Wang, Haichuan Yang, Chen Yu, Zhangyang Wang, Ji Liu, “Model Compression with Adversarial Robustness: A Unified Optimization Framework”

Stars: ✭ 41 (-90.95%)

Mutual labels: pruning, quantization

bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

Stars: ✭ 56 (-87.64%)

Mutual labels: pruning, quantization

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

Stars: ✭ 138 (-69.54%)

Mutual labels: pruning, quantization

Video Hub App

Official repository for Video Hub App

Stars: ✭ 272 (-39.96%)

Mutual labels: open-source, opensource

Online

Collabora Online is a collaborative online office suite based on LibreOffice technology. This is also the source for the Collabora Office apps for iOS and Android.

Stars: ✭ 278 (-38.63%)

Mutual labels: open-source, opensource

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

Stars: ✭ 666 (+47.02%)

Mutual labels: pruning, quantization

torch-model-compression

针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库

Stars: ✭ 126 (-72.19%)

Mutual labels: pruning, quantization

DNNAC

All about acceleration and compression of Deep Neural Networks

Stars: ✭ 29 (-93.6%)

Mutual labels: compression, quantization

sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Stars: ✭ 264 (-41.72%)

Mutual labels: pruning, quantization

torchprune

A research library for pytorch-based neural network pruning, compression, and more.

Stars: ✭ 133 (-70.64%)

Mutual labels: compression, pruning

Open Pixel Art

A collaborative pixel art project to teach people how to contribute to open-source

Stars: ✭ 331 (-26.93%)

Mutual labels: open-source, opensource

Caffe

Caffe for Sparse and Low-rank Deep Neural Networks

Stars: ✭ 339 (-25.17%)

Mutual labels: deep-neural-networks, compression

Freedomotic

Open IoT Framework

Stars: ✭ 354 (-21.85%)

Mutual labels: open-source, opensource

Kafka Ui

Open-Source Web GUI for Apache Kafka Management

Stars: ✭ 230 (-49.23%)

Mutual labels: open-source, opensource

View All Similar Projects ➔

AI Model Efficiency Toolkit (AIMET)

AIMET is a library that provides advanced model quantization and compression techniques for trained neural network models. It provides features that have been proven to improve run-time performance of deep learning neural network models with lower compute and memory requirements and minimal impact to task accuracy.

AIMET is designed to work with PyTorch and TensorFlow models.

We also host the AIMET Model Zoo - a collection of popular neural network models optimized for 8-bit inference. We also provide recipes for users to quantize floating point models using AIMET.

Why AIMET?
Supported features
Results
Getting started
Contributions
Team
License

Why AIMET?

Supports advanced quantization techniques: Inference using integer runtimes is significantly faster than using floating-point runtimes. For example, models run 5x-15x faster on the Qualcomm Hexagon DSP than on the Qualcomm Kyro CPU. In addition, 8-bit precision models have a 4x smaller footprint than 32-bit precision models. However, maintaining model accuracy when quantizing ML models is often challenging. AIMET solves this using novel techniques like Data-Free Quantization that provide state-of-the-art INT8 results on several popular models.
Supports advanced model compression techniques that enable models to run faster at inference-time and require less memory
AIMET is designed to automate optimization of neural networks avoiding time-consuming and tedious manual tweaking. AIMET also provides user-friendly APIs that allow users to make calls directly from their TensorFlow or PyTorch pipelines.

Please visit the AIMET on Github Pages for more details.

Supported Features

Quantization

Cross-Layer Equalization: Equalize weight tensors to reduce amplitude variation across channels
Bias Correction: Corrects shift in layer outputs introduced due to quantization
Quantization Simulation: Simulate on-target quantized inference accuracy
Fine-tuning: Use quantization simulation to train the model further to improve accuracy

Model Compression

Spatial SVD: Tensor decomposition technique to split a large layer into two smaller ones
Channel Pruning: Removes redundant input channels from a layer and reconstructs layer weights
Per-layer compression-ratio selection: Automatically selects how much to compress each layer in the model

Visualization

Weight ranges: Inspect visually if a model is a candidate for applying the Cross Layer Equalization technique. And the effect after applying the technique
Per-layer compression sensitivity: Visually get feedback about the sensitivity of any given layer in the model to compression

Results

AIMET can quantize an existing 32-bit floating-point model to an 8-bit fixed-point model without sacrificing much accuracy and without model fine-tuning. As an example of accuracy maintained, the DFQ method applied to several popular networks, such as MobileNet-v2 and ResNet-50, result in less than 0.9% loss in accuracy all the way down to 8-bit quantization — in an automated way without any training data.

Models	FP32	INT8 Simulation
MobileNet v2 (top1)	71.72%	71.08%
ResNet 50 (top1)	76.05%	75.45%
DeepLab v3 (mIOU)	72.65%	71.91%

AIMET can also significantly compress models. For popular models, such as Resnet-50 and Resnet-18, compression with spatial SVD plus channel pruning achieves 50% MAC (multiply-accumulate) reduction while retaining accuracy within approx. 1% of the original uncompressed model.

Models	Uncompressed model	50% Compressed model
ResNet18 (top1)	69.76%	68.56%
ResNet 50 (top1)	76.05%	75.75%

Getting Started

Contributions

Thanks for your interest in contributing to AIMET! Please read our Contributions Page for more information on contributing features or bug fixes. We look forward to your participation!

Team

AIMET aims to be a community-driven project maintained by Qualcomm Innovation Center, Inc.

License

AIMET is licensed under the BSD 3-clause “New” or “Revised” License. Check out the LICENSE for more details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 453

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (47) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

quic / Aimet

Programming Languages

Labels

Projects that are alternatives of or similar to Aimet

AI Model Efficiency Toolkit (AIMET)

Table of Contents

Why AIMET?

Supported Features

Quantization

Model Compression

Visualization

Results

Getting Started

Contributions

Team

License