Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jxhe → Vae Lagging Encoder

jxhe / Vae Lagging Encoder

Licence: mit

PyTorch implementation of "Lagging Inference Networks and Posterior Collapse in Variational Autoencoders" (ICLR 2019)

Programming Languages

python

139335 projects - #7 most used programming language

Labels

vae image-generation text-generation

Projects that are alternatives of or similar to Vae Lagging Encoder

Deepnude An Image To Image Technology

DeepNude's algorithm and general image generation theory and practice research, including pix2pix, CycleGAN, UGATIT, DCGAN, SinGAN, ALAE, mGANprior, StarGAN-v2 and VAE models (TensorFlow2 implementation). DeepNude的算法以及通用生成对抗网络（GAN,Generative Adversarial Network）图像生成的理论与实践研究。

Stars: ✭ 4,029 (+2533.33%)

Mutual labels: vae, image-generation

Keras-Generating-Sentences-from-a-Continuous-Space

Text Variational Autoencoder inspired by the paper 'Generating Sentences from a Continuous Space' Bowman et al. https://arxiv.org/abs/1511.06349

Stars: ✭ 32 (-79.08%)

Mutual labels: text-generation, vae

soft-intro-vae-pytorch

[CVPR 2021 Oral] Official PyTorch implementation of Soft-IntroVAE from the paper "Soft-IntroVAE: Analyzing and Improving Introspective Variational Autoencoders"

Stars: ✭ 170 (+11.11%)

Mutual labels: vae, image-generation

Vae For Image Generation

Implemented Variational Autoencoder generative model in Keras for image generation and its latent space visualization on MNIST and CIFAR10 datasets

Stars: ✭ 87 (-43.14%)

Mutual labels: vae, image-generation

Onnxt5

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Stars: ✭ 143 (-6.54%)

Mutual labels: text-generation

Oneshottranslation

Pytorch implementation of "One-Shot Unsupervised Cross Domain Translation" NIPS 2018

Stars: ✭ 135 (-11.76%)

Mutual labels: image-generation

Cyclegan

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Stars: ✭ 10,933 (+7045.75%)

Mutual labels: image-generation

Mlds2018spring

Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

Stars: ✭ 124 (-18.95%)

Mutual labels: image-generation

Nlp pytorch project

Embedding, NMT, Text_Classification, Text_Generation, NER etc.

Stars: ✭ 153 (+0%)

Mutual labels: text-generation

Pytorch Vae

A Collection of Variational Autoencoders (VAE) in PyTorch.

Stars: ✭ 2,704 (+1667.32%)

Mutual labels: vae

Focal Frequency Loss

Focal Frequency Loss for Generative Models

Stars: ✭ 141 (-7.84%)

Mutual labels: image-generation

Gesturegan

[ACM MM 2018 Oral] GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

Stars: ✭ 136 (-11.11%)

Mutual labels: image-generation

Guyu

pre-training and fine-tuning framework for text generation

Stars: ✭ 144 (-5.88%)

Mutual labels: text-generation

Deep Learning With Python

Example projects I completed to understand Deep Learning techniques with Tensorflow. Please note that I do no longer maintain this repository.

Stars: ✭ 134 (-12.42%)

Mutual labels: vae

Kobe

Source code and dataset for KDD 2019 paper "Towards Knowledge-Based Personalized Product Description Generation in E-commerce"

Stars: ✭ 148 (-3.27%)

Mutual labels: text-generation

Srl Zoo

State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox

Stars: ✭ 125 (-18.3%)

Mutual labels: vae

Vmf vae nlp

Code for EMNLP18 paper "Spherical Latent Spaces for Stable Variational Autoencoders"

Stars: ✭ 140 (-8.5%)

Mutual labels: vae

Beat Blender

Blend beats using machine learning to create music in a fun new way.

Stars: ✭ 147 (-3.92%)

Mutual labels: vae

Tensorflow Mnist Cvae

Tensorflow implementation of conditional variational auto-encoder for MNIST

Stars: ✭ 139 (-9.15%)

Mutual labels: vae

Unetgan

Official Implementation of the paper "A U-Net Based Discriminator for Generative Adversarial Networks" (CVPR 2020)

Stars: ✭ 139 (-9.15%)

Mutual labels: image-generation

View All Similar Projects ➔

Aggressive Training of Inference Network

This is PyTorch implementation of the paper:

Lagging Inference Networks and Posterior Collapse in Variational Autoencoders
Junxian He, Daniel Spokoyny, Graham Neubig, Taylor Berg-Kirkpatrick
ICLR 2019

The code seperates optimization of encoder and decoder in VAE, and performs more steps of encoder update in each iteration. This new training procedure mitigates the issue of posterior collapse in VAE and leads to a better VAE model, without changing model components and training objective.

This repo is able to reproduce quantitative experimental results and qualitative visualizations of posterior mean space presented in the paper.

Please contact [email protected] if you have any questions.

(a) basic VAE training (b) Aggressive VAE training

Posterior Mean Space

Our approach is inspired by the definition of "posterior mean space", which helps observe the posterior status over course of training and analyze VAE training behavior from the perspective of training dynamics. In the paper we experimented with a toy dataset and a scalar latent variable, so that posterior mean space is on a 2-d plane.

Requirements

Python >= 3.6
PyTorch >= 0.4

Data

Datasets used in this paper can be downloaded with:

python prepare_data.py

By default it downloads all four datasets used in the paper, downloaded data is located in ./datasets/. A --dataset option can be provided to specify the dataset name to be downloaded:

python prepare_data.py --dataset yahoo

The argument should be synthetic, yahoo, yelp, or omniglot.

Usage

Example script to train VAE on text data (training uses GPU when available):

python text.py --dataset yahoo --aggressive 1 --warm_up 10 --kl_start 0.1

image data:

python image.py --dataset omniglot --aggressive 1 --warm_up 10 --kl_start 0.1

Logs would be printed on standard output and also saved into folder logs.

Here:

--dataset specifies the dataset name, currently it supports synthetic, yahoo, yelp for text.py and omniglot for image.py
--aggressive controls whether applies aggressive training or not
--kl_start represents starting KL weight (set to 1.0 to disable KL annealing)
--warm_up represents number of annealing epochs (KL weight increases from kl_start to 1.0 linearly in the first warm_up epochs)

To run the code on your own text/image dataset, you need to create a new configuration file in ./config/ folder to specifiy network hyperparameters and datapath. If the new config file is ./config/config_abc.py, then --dataset needs to be set as abc accordingly.

Visualization of Posterior Mean Space

We project 500 data samples onto posterior mean space and observe the change over course of training.

To reproduce this visualization figure, first train the model on the toy dataset to save statistics required for visualization (training uses GPU when available):

python toy.py --aggressive 1 --plot_mode multiple

Here --plot_mode can be specified as single to reproduce the single-point trajectory figure (Figure 3 in the paper). This command trains a VAE model with aggressive training on synthetic data, and the required statistics is saved in folder plot_data (folder would be created automatically if non-existing).

Then run the plotting script:

python plot_scripts/plot_multiple.py --aggressive 1 --iter 2000

Here --aggressive specifies the aggressive training mode (should be the same as training), --iter specifies for which iteration the figure is plotted, we save the plotting data every 200 iterations by default at training time. The generated figures would be saved in folder plot_figure as pdf files (folder would be created automatically if non-existing).

Similarly, run plot_single.py is able to generate the single-point trajectory figure.

Text Generation

Text generation is supported through sampling from either the prior or posterior (i.e. reconstruction).

Generation from prior (by default it generates 100 samples):

python text.py --dataset [dataset] --decode_from [pretrained model path]

Reconstruction:

python text.py --dataset [dataset] --decode_from [pretrained model path] --decode_input [a text file for reconstruction]

--decode_input file has one raw sentence per line, which is the same format as training data.

Optional --decoding_strategy argument can be used to specifiy decoding strategy as {greedy, beam, sample}. By default greedy decoding is performed. Generated sentences are saved in folder samples.

Mutual Information and KL Curve

To plot the KL and mutual information curves over course of training (Figure 5 in the paper), first run:

python text.py --dataset yelp --aggressive 1 --warm_up 10 --kl_start 1.0

Then please refer to plot_scripts/plot_log.ipynb as an example to read the log file and generate plots.

Reference

@inproceedings{he2018lagging,
title={Lagging Inference Networks and Posterior Collapse in Variational Autoencoders},
author={Junxian He and Daniel Spokoyny and Graham Neubig and Taylor Berg-Kirkpatrick},
booktitle={Proceedings of ICLR},
year={2019}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 153

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗