musikisomorphie / swd

Licence: other

unsupervised video and image generation

Programming Languages

python

139335 projects - #7 most used programming language

C++

36643 projects - #6 most used programming language

Projects that are alternatives of or similar to swd

Vae For Image Generation

Implemented Variational Autoencoder generative model in Keras for image generation and its latent space visualization on MNIST and CIFAR10 datasets

Stars: ✭ 87 (+74%)

Mutual labels: generative-model, image-generation

Texturize

🤖🖌️ Generate photo-realistic textures based on source images. Remix, remake, mashup! Useful if you want to create variations on a theme or elaborate on an existing texture.

Stars: ✭ 366 (+632%)

Mutual labels: generative-model, image-generation

Generating Devanagari Using Draw

PyTorch implementation of DRAW: A Recurrent Neural Network For Image Generation trained on Devanagari dataset.

Stars: ✭ 82 (+64%)

Mutual labels: generative-model, image-generation

Lggan

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Stars: ✭ 97 (+94%)

Mutual labels: generative-model, image-generation

TriangleGAN

TriangleGAN, ACM MM 2019.

Stars: ✭ 28 (-44%)

Mutual labels: generative-model, image-generation

MoCoGAN-HD

[ICLR 2021 Spotlight] A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Stars: ✭ 224 (+348%)

Mutual labels: image-generation, video-generation

Gesturegan

[ACM MM 2018 Oral] GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

Stars: ✭ 136 (+172%)

Mutual labels: generative-model, image-generation

texturize

🤖🖌️ Generate photo-realistic textures based on source images. Remix, remake, mashup! Useful if you want to create variations on a theme or elaborate on an existing texture.

Stars: ✭ 495 (+890%)

Mutual labels: generative-model, image-generation

Awesome-Text-to-Image

A Survey on Text-to-Image Generation/Synthesis.

Stars: ✭ 251 (+402%)

Mutual labels: image-generation

DiffuseVAE

A combination of VAE's and Diffusion Models for efficient, controllable and high-fidelity generation from low-dimensional latents

Stars: ✭ 81 (+62%)

Mutual labels: generative-model

Semantic Pyramid for Image Generation

PyTorch reimplementation of the paper: "Semantic Pyramid for Image Generation" [CVPR 2020].

Stars: ✭ 45 (-10%)

Mutual labels: image-generation

Generalized-PixelVAE

PixelVAE with or without regularization

Stars: ✭ 64 (+28%)

Mutual labels: generative-model

BtcDet

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Stars: ✭ 104 (+108%)

Mutual labels: generative-model

generative deep learning

Generative Deep Learning Sessions led by Anugraha Sinha (Machine Learning Tokyo)

Stars: ✭ 24 (-52%)

Mutual labels: generative-model

RG-Flow

This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029

Stars: ✭ 58 (+16%)

Mutual labels: generative-model

graph-nvp

GraphNVP: An Invertible Flow Model for Generating Molecular Graphs

Stars: ✭ 69 (+38%)

Mutual labels: generative-model

vae-torch

Variational autoencoder for anomaly detection (in PyTorch).

Stars: ✭ 38 (-24%)

Mutual labels: generative-model

HugsVision

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Stars: ✭ 154 (+208%)

Mutual labels: image-generation

canvas-constructor

An ES6 utility for canvas with built-in functions and chained methods.

Stars: ✭ 96 (+92%)

Mutual labels: image-generation

DVAE

Official implementation of Dynamical VAEs

Stars: ✭ 75 (+50%)

Mutual labels: generative-model

View All Similar Projects ➔

Sliced Wasserstein Generative Models
-- and the application on unsupervised video generation

Papers

Prerequisites

This repo has been successfully tested on tensorflow 1.10, cuda 9.0.

Please check the requirements.txt for more details.
For the training data such as Cifar10, CelebA, CelebA-HQ, LSUN etc download them on the official website accordingly.
TrailerFaces (Note that we tentatively release the tfrecord data to avoid the copyright issue.)
- The dataset contains approximately 200,000 individual clips of various facial expressions, where the faces are cropped with 256x256 resolution from about 6,000 high resolution movie trailers on YouTube. We convert them to tfrecord with resolutions range from 4x4 to 256x256. More about the data processing please see Towards high resolution video generation (Arxiv).

TrailerFaces sample:

Standardard Training

SWAE: it requires some custom ops, which are stored under the cuda folder.

Following the instructions in install, you could compile them by yourself. If you install tensorflow by pip, one potential error can be some source files of tensorflow set the wrong relative path of cuda.h, you just need to manually change them according to your cuda path.
Alternatively, you could also use the binary files directly, which is compiled with cuda8.0.
Specify DATA_DIR, LOG_DIR and DIR in standard_training/swae_64x64.py, then run
- cd standard_training/
- python swae_64x64.py

SWGAN:

Specify DATA_DIR, LOG_DIR and DIR in standard_training/swgan_64x64.py, then run
- cd standard_training/
- python swgan_64x64.py

Progressive Training

PG-SWGAN-3D:

Specify data_dir, result_dir in progressive_training/config.py, then run
- cd progressive_training/
- python train.py
We trained our model with 1 TitanXp GPU for roughly 7 days, since our code is built upon progressive growing GAN (https://arxiv.org/pdf/1710.10196.pdf), the code can be easily adapted to multigpu training on better GPU (See progressive_training/config.py). It is expected that the training speed can be significantly improved. Here are some sample frames generated by PG-SWGAN-3D:
More video comparison, see the following youtube links:
- PG-SWGAN-3D VS PG-WGAN-3D: See video1
- VGAN VS MoCoGAN: See video2

Citation

If you use this code for your research, please cite our papers.

@inproceedings{jqwu&zwhuang2019swgm,
  title={Sliced Wasserstein Generative Models},  
  author={Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc Van Gool},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2019}  
}

Acknowledgments

This code borrows from WGAN-GP (https://github.com/igul222/improved_wgan_training) and PGGAN (https://github.com/tkarras/progressive_growing_of_gans). We would like to thank them for the contribution.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

musikisomorphie / swd

Programming Languages

Labels

Projects that are alternatives of or similar to swd

Sliced Wasserstein Generative Models
-- and the application on unsupervised video generation

Papers

Prerequisites

Standardard Training

SWAE: it requires some custom ops, which are stored under the cuda folder.

SWGAN:

Progressive Training

PG-SWGAN-3D:

Citation

Acknowledgments

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

musikisomorphie / swd

Programming Languages

Labels

Projects that are alternatives of or similar to swd

Sliced Wasserstein Generative Models-- and the application on unsupervised video generation

Papers

Prerequisites

Standardard Training

SWAE: it requires some custom ops, which are stored under the cuda folder.

SWGAN:

Progressive Training

PG-SWGAN-3D:

Citation

Acknowledgments

Sliced Wasserstein Generative Models
-- and the application on unsupervised video generation