Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → omertov → Encoder4editing

omertov / Encoder4editing

Licence: mit

Official implementation of "Desinging an Encoder for StyleGAN Image Manipulation" https://arxiv.org/abs/2102.02766

Labels

jupyter-notebook encoder

Projects that are alternatives of or similar to Encoder4editing

Tutorial Arima W Jeffrey Yau

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Starcraft2 Replay Analysis

A jupyter notebook that provides analysis for StarCraft 2 replays

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Music Generation

An RNN implementation to generate music

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

A structured collection of tagged notes about machine learning theory and practice endowed with search infrastructure that allows users to read requested info only.

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Fashion recommender system using deep learning

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Python For Signal Processing

Notebooks for "Python for Signal Processing" book

Stars: ✭ 1,296 (+1324.18%)

Mutual labels: jupyter-notebook

Machine Learning Numpy

Gathers Machine learning models using pure Numpy to cover feed-forward, RNN, CNN, clustering, MCMC, timeseries, tree-based, and so much more!

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

import code from IJulia Jupyter notebooks into Julia programs

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Appliedml python coursera

Material and note of the course of Applied ML in Python

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Kafka Sparkstreaming Cassandra

Docker container for Kafka - Spark Streaming - Cassandra

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Trained Ternary Quantization

Reducing the size of convolutional neural networks

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

비전공생도 한눈에 이해하는 딥러닝 자료모음

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Kaggle House Prices Advanced Regression Techniques

Udacity capstone project: Kaggle competition on house prices prediction using advanced regression techniques

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Theconsciousnessprior

AI-ON Consciousness Prior

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Deep Learning Tutorial Pydata

Deep learning tutorial for PyData

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Pymc Example Project

Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Understanding facial beauty with deep learning.

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Python courses for the scientific researcher

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.

Stars: ✭ 90 (-1.1%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

Designing an Encoder for StyleGAN Image Manipulation

Recently, there has been a surge of diverse methods for performing image editing by employing pre-trained unconditional generators. Applying these methods on real images, however, remains a challenge, as it necessarily requires the inversion of the images into their latent space. To successfully invert a real image, one needs to find a latent code that reconstructs the input image accurately, and more importantly, allows for its meaningful manipulation. In this paper, we carefully study the latent space of StyleGAN, the state-of-the-art unconditional generator. We identify and analyze the existence of a distortion-editability tradeoff and a distortion-perception tradeoff within the StyleGAN latent space. We then suggest two principles for designing encoders in a manner that allows one to control the proximity of the inversions to regions that StyleGAN was originally trained on. We present an encoder based on our two principles that is specifically designed for facilitating editing on real images by balancing these tradeoffs. By evaluating its performance qualitatively and quantitatively on numerous challenging domains, including cars and horses, we show that our inversion method, followed by common editing techniques, achieves superior real-image editing quality, with only a small reconstruction accuracy drop.

Description

Official Implementation of "Designing an Encoder for StyleGAN Image Manipulation" paper for both training and evaluation. The e4e encoder is specifically designed to complement existing image manipulation techniques performed over StyleGAN's latent space.

Getting Started

Prerequisites

Linux or macOS
NVIDIA GPU + CUDA CuDNN (CPU may be possible with some modifications, but is not inherently supported)
Python 3

Installation

Clone the repository:

git clone https://github.com/omertov/encoder4editing.git
cd encoder4editing

Dependencies:
We recommend running this repository using Anaconda. All dependencies for defining the environment are provided in environment/e4e_env.yaml.

Inference Notebook

We provide a Jupyter notebook found in notebooks/inference_playground.ipynb that allows one to encode and perform several editings on real images using StyleGAN.

Pretrained Models

Please download the pre-trained models from the following links. Each e4e model contains the entire pSp framework architecture, including the encoder and decoder weights. | Path | Description | :--- | :---------- |FFHQ Inversion | FFHQ e4e encoder. |Cars Inversion | Cars e4e encoder. |Horse Inversion | Horse e4e encoder. |Church Inversion | Church e4e encoder.

If you wish to use one of the pretrained models for training or inference, you may do so using the flag --checkpoint_path.

In addition, we provide various auxiliary models needed for training your own e4e model from scratch. | Path | Description | :--- | :---------- |FFHQ StyleGAN | StyleGAN model pretrained on FFHQ taken from rosinality with 1024x1024 output resolution. |IR-SE50 Model | Pretrained IR-SE50 model taken from TreB1eN for use in our ID loss during training. |MOCOv2 Model | Pretrained ResNet-50 model trained using MOCOv2 for use in our simmilarity loss for domains other then human faces during training.

By default, we assume that all auxiliary models are downloaded and saved to the directory pretrained_models. However, you may use your own paths by changing the necessary values in configs/path_configs.py.

Training

To train the e4e encoder, make sure the paths to the required models, as well as training and testing data is configured in configs/path_configs.py and configs/data_configs.py.

Training the e4e Encoder

python scripts/train.py \
--dataset_type cars_encode \
--exp_dir new/experiment/directory \
--start_from_latent_avg \
--use_w_pool \
--w_discriminator_lambda 0.1 \
--progressive_start 20000 \
--id_lambda 0.5 \
--val_interval 10000 \
--max_steps 200000 \
--stylegan_size 512 \
--stylegan_weights path/to/pretrained/stylegan.pt \
--workers 8 \
--batch_size 8 \
--test_batch_size 4 \
--test_workers 4

Training on your own dataset

In order to train the e4e encoder on a custom dataset, perform the following adjustments:

Insert the paths to your train and test data into the dataset_paths variable defined in configs/paths_config.py:

dataset_paths = {
    'my_train_data': '/path/to/train/images/directory',
    'my_test_data': '/path/to/test/images/directory'
}

Configure a new dataset under the DATASETS variable defined in configs/data_configs.py:

DATASETS = {
   'my_data_encode': {
        'transforms': transforms_config.EncodeTransforms,
        'train_source_root': dataset_paths['my_train_data'],
        'train_target_root': dataset_paths['my_train_data'],
        'test_source_root': dataset_paths['my_test_data'],
        'test_target_root': dataset_paths['my_test_data']
    }
}

Refer to configs/transforms_config.py for the transformations applied to the train and test images during training.

Finally, run a training session with --dataset_type my_data_encode.

Inference

Having trained your model, you can use scripts/inference.py to apply the model on a set of images.
For example,

python scripts/inference.py \
--images_dir=/path/to/images/directory \
--save_dir=/path/to/saving/directory \
path/to/checkpoint.pt

Latent Editing Consistency (LEC)

As described in the paper, we suggest a new metric, Latent Editing Consistency (LEC), for evaluating the encoder's performance. We provide an example for calculating the metric over the FFHQ StyleGAN using the aging editing direction in metrics/LEC.py.

To run the example:

cd metrics
python LEC.py \
--images_dir=/path/to/images/directory \
path/to/checkpoint.pt

Acknowledgments

This code borrows heavily from pixel2style2pixel

Citation

If you use this code for your research, please cite our paper Designing an Encoder for StyleGAN Image Manipulation:

@article{tov2021designing,
  title={Designing an Encoder for StyleGAN Image Manipulation},
  author={Tov, Omer and Alaluf, Yuval and Nitzan, Yotam and Patashnik, Or and Cohen-Or, Daniel},
  journal={arXiv preprint arXiv:2102.02766},
  year={2021}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 91

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (5) 🔗