All Projects → Silya-1 → fast-fomm-mobile

Silya-1 / fast-fomm-mobile

Licence: MIT license
Сompresssing First Order Motion Model for Image Animation to enable its real-time inference on mobile devices

Projects that are alternatives of or similar to fast-fomm-mobile

chainer-fcis
[This project has moved to ChainerCV] Chainer Implementation of Fully Convolutional Instance-aware Semantic Segmentation
Stars: ✭ 45 (+80%)
Mutual labels:  inference
barracuda-style-transfer
Companion code for the Unity Style Transfer blog post, showcasing realtime style transfer using Barracuda.
Stars: ✭ 126 (+404%)
Mutual labels:  inference
pia
📚 🔬 PIA - Protein Inference Algorithms
Stars: ✭ 19 (-24%)
Mutual labels:  inference
ims
📚 Introduction to Modern Statistics - A college-level open-source textbook with a modern approach highlighting multivariable relationships and simulation-based inference.
Stars: ✭ 509 (+1936%)
Mutual labels:  inference
gospn
A free, open-source inference and learning library for Sum-Product Networks (SPN)
Stars: ✭ 24 (-4%)
Mutual labels:  inference
RECCON
This repository contains the dataset and the PyTorch implementations of the models from the paper Recognizing Emotion Cause in Conversations.
Stars: ✭ 126 (+404%)
Mutual labels:  inference
tiny-schema-validator
JSON schema validator
Stars: ✭ 181 (+624%)
Mutual labels:  inference
nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
Stars: ✭ 211 (+744%)
Mutual labels:  inference
intruder-detector-python
Build an application that alerts you when someone enters a restricted area. Learn how to use models for multiclass object detection.
Stars: ✭ 16 (-36%)
Mutual labels:  inference
ai-serving
Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints
Stars: ✭ 122 (+388%)
Mutual labels:  inference
Molecules Dataset Collection
Collection of data sets of molecules for a validation of properties inference
Stars: ✭ 69 (+176%)
Mutual labels:  inference
mediapipe plus
The purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (+52%)
Mutual labels:  inference
go-topics
Latent Dirichlet Allocation
Stars: ✭ 23 (-8%)
Mutual labels:  inference
mlss-2016
MLSS 2016 material.
Stars: ✭ 22 (-12%)
Mutual labels:  inference
modelbox
A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用。
Stars: ✭ 48 (+92%)
Mutual labels:  inference
r2inference
RidgeRun Inference Framework
Stars: ✭ 22 (-12%)
Mutual labels:  inference
object-size-detector-python
Monitor mechanical bolts as they move down a conveyor belt. When a bolt of an irregular size is detected, this solution emits an alert.
Stars: ✭ 26 (+4%)
Mutual labels:  inference
onnxruntime-rs
Rust wrapper for Microsoft's ONNX Runtime (version 1.8)
Stars: ✭ 149 (+496%)
Mutual labels:  inference
motor-defect-detector-python
Predict performance issues with manufacturing equipment motors. Perform local or cloud analytics of the issues found, and then display the data on a user interface to determine when failures might arise.
Stars: ✭ 24 (-4%)
Mutual labels:  inference
infer
🔮 Use TensorFlow models in Go to evaluate Images (and more soon!)
Stars: ✭ 65 (+160%)
Mutual labels:  inference

Fast-FOMM-Mobile

The code was written by Arman Tsaturyan, Nikita Mokrov, Ilya Selnitskiy and Ilya Zakharkin.

About

The purpose of the project was to compress First-Order Motion Model for conditional image generation task to enable its real-time inference on mobile devices. We were inspired by three novel works: First-Order Motion Model (FOMM), GAN Compression and StyleGAN2 Distillation, we came up with our approach that we call 2pix2pix. The main idea was to gather a distilled dataset based on FOMM predictions, then train pix2pix generator that is being fed 2 images: "source" and "driving", and predict the "target" image with it. Loss is calculated based on original FOMM predictions, so the aim of 2pix2pix is to predict as similar to FOMM as possible given absolutely the same input. We also benchmark all used models: FOMM, original pix2pix and compressed pix2pix on CPU, GPU and mobile processors.

Video

Please see a video of project presentation for more details.

Prerequisites

  • Linux or macOS
  • Python 3
  • CPU or NVIDIA GPU + CUDA

Description

First Order Motion Model

This is a fork of the original FOMM model. We added a script generation_syntetic_dataset_v3_recognition.py for synthetic dataset creation, it consists of triplets: (source image, driving image, FOMM-predicted image). FOMM-predicted image is a result of transformation of the source image in a position of driving one with FOMM. The example of such a triplet is presented below. Here is a link to the created dataset.

Gan Compression

The module is also a fork of the original GAN Compression model. Here we added several important improvenents:

  1. Triplet dataloader for pix2pix model;
  2. Dense Motion block inside of the pix2pix;
  3. CoordConv block.

Example of the Dense Motion block prediction for random images during training:

See a demo notebook to launch our model.

ONNX to Core ML Converter

This submodule is a fork of ONNX to Core ML Converter. The module aimed at converting PyTorch modules to Apple CoreML format. It's a special format for model inference on Apple devices. There is no straightforward solution to convert PyTorch models directly to CoreML, therefore an intermediate conversion to ONNX format is used.

Have look at gifs in pics folder

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].