musikisomorphie / swd

Licence: other
unsupervised video and image generation

Programming Languages

python
139335 projects - #7 most used programming language
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to swd

Vae For Image Generation
Implemented Variational Autoencoder generative model in Keras for image generation and its latent space visualization on MNIST and CIFAR10 datasets
Stars: ✭ 87 (+74%)
Mutual labels:  generative-model, image-generation
Texturize
🤖🖌️ Generate photo-realistic textures based on source images. Remix, remake, mashup! Useful if you want to create variations on a theme or elaborate on an existing texture.
Stars: ✭ 366 (+632%)
Mutual labels:  generative-model, image-generation
Generating Devanagari Using Draw
PyTorch implementation of DRAW: A Recurrent Neural Network For Image Generation trained on Devanagari dataset.
Stars: ✭ 82 (+64%)
Mutual labels:  generative-model, image-generation
Lggan
[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation
Stars: ✭ 97 (+94%)
Mutual labels:  generative-model, image-generation
TriangleGAN
TriangleGAN, ACM MM 2019.
Stars: ✭ 28 (-44%)
Mutual labels:  generative-model, image-generation
MoCoGAN-HD
[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis
Stars: ✭ 224 (+348%)
Mutual labels:  image-generation, video-generation
Gesturegan
[ACM MM 2018 Oral] GestureGAN for Hand Gesture-to-Gesture Translation in the Wild
Stars: ✭ 136 (+172%)
Mutual labels:  generative-model, image-generation
texturize
🤖🖌️ Generate photo-realistic textures based on source images. Remix, remake, mashup! Useful if you want to create variations on a theme or elaborate on an existing texture.
Stars: ✭ 495 (+890%)
Mutual labels:  generative-model, image-generation
Awesome-Text-to-Image
A Survey on Text-to-Image Generation/Synthesis.
Stars: ✭ 251 (+402%)
Mutual labels:  image-generation
DiffuseVAE
A combination of VAE's and Diffusion Models for efficient, controllable and high-fidelity generation from low-dimensional latents
Stars: ✭ 81 (+62%)
Mutual labels:  generative-model
Semantic Pyramid for Image Generation
PyTorch reimplementation of the paper: "Semantic Pyramid for Image Generation" [CVPR 2020].
Stars: ✭ 45 (-10%)
Mutual labels:  image-generation
Generalized-PixelVAE
PixelVAE with or without regularization
Stars: ✭ 64 (+28%)
Mutual labels:  generative-model
BtcDet
Behind the Curtain: Learning Occluded Shapes for 3D Object Detection
Stars: ✭ 104 (+108%)
Mutual labels:  generative-model
generative deep learning
Generative Deep Learning Sessions led by Anugraha Sinha (Machine Learning Tokyo)
Stars: ✭ 24 (-52%)
Mutual labels:  generative-model
RG-Flow
This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029
Stars: ✭ 58 (+16%)
Mutual labels:  generative-model
graph-nvp
GraphNVP: An Invertible Flow Model for Generating Molecular Graphs
Stars: ✭ 69 (+38%)
Mutual labels:  generative-model
vae-torch
Variational autoencoder for anomaly detection (in PyTorch).
Stars: ✭ 38 (-24%)
Mutual labels:  generative-model
HugsVision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (+208%)
Mutual labels:  image-generation
canvas-constructor
An ES6 utility for canvas with built-in functions and chained methods.
Stars: ✭ 96 (+92%)
Mutual labels:  image-generation
DVAE
Official implementation of Dynamical VAEs
Stars: ✭ 75 (+50%)
Mutual labels:  generative-model

Sliced Wasserstein Generative Models
-- and the application on unsupervised video generation

Papers

Prerequisites

This repo has been successfully tested on tensorflow 1.10, cuda 9.0.

  • Please check the requirements.txt for more details.

  • For the training data such as Cifar10, CelebA, CelebA-HQ, LSUN etc download them on the official website accordingly.

  • TrailerFaces (Note that we tentatively release the tfrecord data to avoid the copyright issue.)

    • The dataset contains approximately 200,000 individual clips of various facial expressions, where the faces are cropped with 256x256 resolution from about 6,000 high resolution movie trailers on YouTube. We convert them to tfrecord with resolutions range from 4x4 to 256x256. More about the data processing please see Towards high resolution video generation (Arxiv).

TrailerFaces sample: Trailerfaces sample

Standardard Training

SWAE: it requires some custom ops, which are stored under the cuda folder.

  • Following the instructions in install, you could compile them by yourself. If you install tensorflow by pip, one potential error can be some source files of tensorflow set the wrong relative path of cuda.h, you just need to manually change them according to your cuda path.
  • Alternatively, you could also use the binary files directly, which is compiled with cuda8.0.
  • Specify DATA_DIR, LOG_DIR and DIR in standard_training/swae_64x64.py, then run
    • cd standard_training/
    • python swae_64x64.py

SWGAN:

  • Specify DATA_DIR, LOG_DIR and DIR in standard_training/swgan_64x64.py, then run
    • cd standard_training/
    • python swgan_64x64.py

Progressive Training

PG-SWGAN-3D:

  • Specify data_dir, result_dir in progressive_training/config.py, then run

    • cd progressive_training/
    • python train.py
  • We trained our model with 1 TitanXp GPU for roughly 7 days, since our code is built upon progressive growing GAN (https://arxiv.org/pdf/1710.10196.pdf), the code can be easily adapted to multigpu training on better GPU (See progressive_training/config.py). It is expected that the training speed can be significantly improved. Here are some sample frames generated by PG-SWGAN-3D: PG-SWGAN-3D

  • More video comparison, see the following youtube links:

    • PG-SWGAN-3D VS PG-WGAN-3D: See video1
    • VGAN VS MoCoGAN: See video2

Citation

  • If you use this code for your research, please cite our papers.
@inproceedings{jqwu&zwhuang2019swgm,
  title={Sliced Wasserstein Generative Models},  
  author={Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc Van Gool},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2019}  
}

Acknowledgments

This code borrows from WGAN-GP (https://github.com/igul222/improved_wgan_training) and PGGAN (https://github.com/tkarras/progressive_growing_of_gans). We would like to thank them for the contribution.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].