Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → tohinz → Consingan

tohinz / Consingan

Licence: mit

PyTorch implementation of "Improved Techniques for Training Single-Image GANs" (WACV-21)

Programming Languages

139335 projects - #7 most used programming language

Labels

image gan image-generation

Projects that are alternatives of or similar to Consingan

Arbitrary Text To Image Papers

A collection of arbitrary text to image papers with code (constantly updating)

Stars: ✭ 196 (-33.33%)

Mutual labels: gan, image-generation, image

Semantic Pyramid for Image Generation

PyTorch reimplementation of the paper: "Semantic Pyramid for Image Generation" [CVPR 2020].

Stars: ✭ 45 (-84.69%)

Mutual labels: gan, image-generation

Pytorch Cyclegan And Pix2pix

Image-to-Image Translation in PyTorch

Stars: ✭ 16,477 (+5504.42%)

Mutual labels: gan, image-generation

Deep-Exemplar-based-Video-Colorization

The source code of CVPR 2019 paper "Deep Exemplar-based Video Colorization".

Stars: ✭ 180 (-38.78%)

Mutual labels: gan, image-generation

StoryGAN: A Sequential Conditional GAN for Story Visualization

Stars: ✭ 184 (-37.41%)

Mutual labels: gan, image-generation

Virtual Clothing Try-on with Deep Learning. PyTorch reproduction of SwapNet by Raj et al. 2018. Now with Docker support!

Stars: ✭ 202 (-31.29%)

Mutual labels: gan, image-generation

automatic-manga-colorization

Use keras.js and cyclegan-keras to colorize manga automatically. All computation in browser. Demo is online:

Stars: ✭ 20 (-93.2%)

Mutual labels: gan, image-generation

Focal Frequency Loss

Focal Frequency Loss for Generative Models

Stars: ✭ 141 (-52.04%)

Mutual labels: gan, image-generation

MNIST-invert-color

Invert the color of MNIST images with PyTorch

Stars: ✭ 13 (-95.58%)

Mutual labels: gan, image-generation

Regularizing Generative Adversarial Networks under Limited Data (CVPR 2021)

Stars: ✭ 127 (-56.8%)

Mutual labels: gan, image-generation

[ACCV 2018 Oral] Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

Stars: ✭ 42 (-85.71%)

Mutual labels: gan, image-generation

Pytorch implementation of "One-Sided Unsupervised Domain Mapping" NIPS 2017

Stars: ✭ 180 (-38.78%)

Mutual labels: gan, image-generation

Inpainting gmcnn

Image Inpainting via Generative Multi-column Convolutional Neural Networks, NeurIPS2018

Stars: ✭ 256 (-12.93%)

Mutual labels: gan, image-generation

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, wav2lip, picture repair, image editing, photo2cartoon, image style transfer, and so on.

Stars: ✭ 4,987 (+1596.26%)

Mutual labels: gan, image-generation

[ECCV 2020 Spotlight] A Simple and Versatile Framework for Image-to-Image Translation

Stars: ✭ 141 (-52.04%)

Mutual labels: gan, image-generation

mSRGAN-A-GAN-for-single-image-super-resolution-on-high-content-screening-microscopy-images.

Generative Adversarial Network for single image super-resolution in high content screening microscopy images

Stars: ✭ 52 (-82.31%)

Mutual labels: gan, image-generation

Oneshottranslation

Pytorch implementation of "One-Shot Unsupervised Cross Domain Translation" NIPS 2018

Stars: ✭ 135 (-54.08%)

Mutual labels: gan, image-generation

Official Implementation of the paper "A U-Net Based Discriminator for Generative Adversarial Networks" (CVPR 2020)

Stars: ✭ 139 (-52.72%)

Mutual labels: gan, image-generation

Applied Deep Learning (2019 Spring) @ NTU

Stars: ✭ 20 (-93.2%)

Mutual labels: gan, image-generation

Awesome-ICCV2021-Low-Level-Vision

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

Stars: ✭ 163 (-44.56%)

Mutual labels: gan, image-generation

View All Similar Projects ➔

ConSinGAN

Official implementation of the paper "Improved Techniques for Training Single-Image GANs" by Tobias Hinz, Matthew Fisher, Oliver Wang, and Stefan Wermter.

For a short summary of our paper see our blog post.

Additional data: Video and Supplementary Material.

We examine and recomment new techniques for training GANs on a single image. Our model is trained iteratively on several different resolutions of the original training image, where the resolution increases as training proceeds. Whenever we increase the resolution of the training image we also increase the capacity of the generator by adding additional convolutional layers. At a given time we do not train the full model, but only parts of it, i.e. the most recently added convolutional layers. The latest convolutional layers are trained with a given learning rate, while previously existing convolutional layers are trained with a smaller learning rate.

Installation

python 3.5
pytorch 1.1.0

pip install -r requirements.txt

Unconditional Generation

To train a model with the default parameters from our paper run:

python main_train.py --gpu 0 --train_mode generation --input_name Images/Generation/angkorwat.jpg

Training one model should take about 20-25 minutes on an NVIDIA GeForce GTX 1080Ti.

Modify Learning Rate Scaling and Number of Trained Stages

To affect sample diversity and image quality we recomment playing around with the learning rate scaling (default is 0.1) and the number of trained stages (default is 6). This can be especially helpful if the images are more complex (use a higher learning rate scaling) or you want to train on images with higher resolution (use more stages). For example, increasing the learning rate scaling will mean that lower stages are trained with a higher learning rate and can, therefore, learn a more faithful model of the original image. For example, observe the difference in generated images of the Colusseum if the model is trained with a learning rate scale of 0.1 or 0.5:

To modify the learning rate scaling run:

python main_train.py --gpu 0 --train_mode generation --input_name Images/Generation/colusseum.png --lr_scale 0.5

Training on more stages can help with images that exhibit a large global structure that should stay the same, see e.g.:

To modify the number of trained stages run:

python main_train.py --gpu 0 --train_mode generation --input_name Images/Generation/colusseum.png --train_stages 7

Results

The output is saved to TrainedModels/ and we log the training process with Tensorboard. The top left image in the visualized image grids is the original training image, all other images are generated images. To monitor the progress go to the respective folder and run

 tensorboard --logdir .

Sample More Images

To sample more images from a trained model run:

python evaluate_model.py --gpu 0 --model_dir TrainedModels/colusseum/.../ --num_samples 50

This will use the model to generate num_samples images in the default as well as scaled resolutions. The results will be saved in a folder Evaluation in the model_dir.

Unconditional Generation (Arbitrary Sizes)

The default unconditional image generation is geared to also induce diversity at the edges of generated images. When generating images of arbitrary sizes (especially larger) this often break the image layout. Therefore, we also provide the option where we change the upsampling and noise addition slightly to improve results when we want to use a model to generate images of arbitrary sizes. The training, model architecture, loss function, etc stay the same, the only change is the addition of the random noise and a slightly different upsampling routine between the different generator stages. To train a model more suited for image generation of arbitrary sizes run:

python main_train.py --gpu 0 --train_mode retarget --input_name Images/Generation/colusseum.png

Image Animation

To train a default animation model (hyperparameters are the same as for image generation):

python main_train.py --gpu 0 --train_mode animation --input_name Images/Animation/lightning1.png

To generate GIFs from the trained model:

python evaluate_model.py --gpu 0 --model_dir TrainedModels/lightning1/...

Strong Animation	Medium Animation	Weak Animation

Harmonization

To train a default harmonization model that does not use anything besides the training image:

python main_train.py --gpu 0 --train_mode harmonization --train_stages 3 --min_size 120 --lrelu_alpha 0.3 --niter 1000 --batch_norm --input_name Images/Harmonization/scream.jpg

Training should take about 5-10 minutes for three stages. Reducing --min_size will speed up the training, increasing it may lead to better results.

To harmonize a given image with a pre-trained model:

python evaluate_model.py --gpu 0 --model_dir TrainedModels/scream/.../ --naive_img Images/Harmonization/scream_naive.jpg

If you already have a naive image that you want to use to monitor the progress (naive image only used at test time, not at train time):

python main_train.py --gpu 0 --train_mode harmonization --train_stages 3 --min_size 120 --lrelu_alpha 0.3 --niter 1000 --batch_norm --input_name Images/Harmonization/pencil_tree.jpg --naive_img Images/Harmonization/pencil_tree_naive.jpg

To fine-tune a pre-trained model on a given image (naive image also used at train time):

python main_train.py --gpu 0 --train_mode harmonization --input_name Images/Harmonization/pencil_tree.jpg --naive_img Images/Harmonization/pencil_tree_naive.jpg --fine_tune --model_dir TrainedModels/pencil_tree/...

Training a model for fine-tuning should take 1-5 minutes, depending on the model and image size.

Editing

Training for the editing task is the same as for the harmonization task, except that we do it on more stages and with a slightly different image augmentation technique where we swap random patches within the training image at each iteration.

python main_train.py --gpu 0 --train_mode editing --batch_norm --niter 1000 --input_name Images/Editing/stone.png

or, if an naive image should be used for monitoring training progress (but not for training itself):

python main_train.py --gpu 0 --train_mode editing --batch_norm --niter 1000 --input_name Images/Editing/stone.png --naive_img Images/Editing/stone_edit_1.png

To fine-tune a model:

python main_train.py --gpu 0 --input_name Images/Editing/stone.png --naive_img Images/Editing/stone_edit_1.png --fine_tune --model_dir TrainedModels/stone/...

To evaluate:

python evaluate_model.py --gpu 0 --model_dir TrainedModels/stone/.../ --naive_img Images/Harmonization/stone_edit_1.png

Additional Data

The folder User-Studies contains the raw images we used to conduct our user study.

Acknowledgements

Our implementation is based on this implementation of the SinGAN paper.

Citation

If you found this code useful please consider citing:

@inproceedings{hinz2021improved,
    author    = {Hinz, Tobias and Fisher, Matthew and Wang, Oliver and Wermter, Stefan},
    title     = {Improved Techniques for Training Single-Image GANs},
    booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
    month     = {January},
    year      = {2021},
    pages     = {1300--1309}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 294

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗