All Projects → yonkshi → text2imageNet

yonkshi / text2imageNet

Licence: other
Generate image from text with Generative Adversarial Network

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to text2imageNet

market risk gan tensorflow
Using Bidirectional Generative Adversarial Networks to estimate Value-at-Risk for Market Risk Management using TensorFlow.
Stars: ✭ 63 (+142.31%)
Mutual labels:  generative-adversarial-network
HashGAN
HashGAN: Deep Learning to Hash with Pair Conditional Wasserstein GAN
Stars: ✭ 63 (+142.31%)
Mutual labels:  generative-adversarial-network
binarygan
Code for "Training Generative Adversarial Networks with Binary Neurons by End-to-end Backpropagation"
Stars: ✭ 25 (-3.85%)
Mutual labels:  generative-adversarial-network
publications-arruda-ijcnn-2019
Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
Stars: ✭ 59 (+126.92%)
Mutual labels:  generative-adversarial-network
Generative Continual Learning
No description or website provided.
Stars: ✭ 51 (+96.15%)
Mutual labels:  generative-adversarial-network
Paper-Notes
Paper notes in deep learning/machine learning and computer vision
Stars: ✭ 37 (+42.31%)
Mutual labels:  generative-adversarial-network
CycleGAN-gluon-mxnet
this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimplementation
Stars: ✭ 31 (+19.23%)
Mutual labels:  generative-adversarial-network
DCGAN-CIFAR10
A implementation of DCGAN (Deep Convolutional Generative Adversarial Networks) for CIFAR10 image
Stars: ✭ 18 (-30.77%)
Mutual labels:  generative-adversarial-network
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+246.15%)
Mutual labels:  generative-adversarial-network
Pytorch-conditional-GANs
Implementation of Conditional Generative Adversarial Networks in PyTorch
Stars: ✭ 91 (+250%)
Mutual labels:  generative-adversarial-network
MultiGraphGAN
MultiGraphGAN for predicting multiple target graphs from a source graph using geometric deep learning.
Stars: ✭ 16 (-38.46%)
Mutual labels:  generative-adversarial-network
Anime2Sketch
A sketch extractor for anime/illustration.
Stars: ✭ 1,623 (+6142.31%)
Mutual labels:  generative-adversarial-network
gans-2.0
Generative Adversarial Networks in TensorFlow 2.0
Stars: ✭ 76 (+192.31%)
Mutual labels:  generative-adversarial-network
pytorch-gans
PyTorch implementation of GANs (Generative Adversarial Networks). DCGAN, Pix2Pix, CycleGAN, SRGAN
Stars: ✭ 21 (-19.23%)
Mutual labels:  generative-adversarial-network
Self-Supervised-GANs
Tensorflow Implementation for paper "self-supervised generative adversarial networks"
Stars: ✭ 34 (+30.77%)
Mutual labels:  generative-adversarial-network
deep-blueberry
If you've always wanted to learn about deep-learning but don't know where to start, then you might have stumbled upon the right place!
Stars: ✭ 17 (-34.62%)
Mutual labels:  generative-adversarial-network
speech-enhancement-WGAN
speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN
Stars: ✭ 35 (+34.62%)
Mutual labels:  generative-adversarial-network
progressive-growing-of-gans.pytorch
Unofficial PyTorch implementation of "Progressive Growing of GANs for Improved Quality, Stability, and Variation".
Stars: ✭ 51 (+96.15%)
Mutual labels:  generative-adversarial-network
SMILE
SMILE: Semantically-guided Multi-attribute Image and Layout Editing, ICCV Workshops 2021.
Stars: ✭ 28 (+7.69%)
Mutual labels:  generative-adversarial-network
GAN-LTH
[ICLR 2021] "GANs Can Play Lottery Too" by Xuxi Chen, Zhenyu Zhang, Yongduo Sui, Tianlong Chen
Stars: ✭ 24 (-7.69%)
Mutual labels:  generative-adversarial-network

Text2Image

Text2Image can understand a human written description of an object to generate a realistic image based on that description. Text2Image is using a type of generative adversarial network (GAN-CLS), implemented from scratch using Tensorflow.

text2image

You can also see a progression of GAN training

GAN_progression

Our project is primarily based on the works of Generative Adversarial Text to Image Synthesis [Reed et al, 2016], the training is done in three steps:

  1. Encoding a pretrained GoogLeNet into high dimensional embedding space
  2. Training a text encoder to encode text into same high dimensional embedding space as step1
  3. Generative adversarial training on both generator network and discriminator network, feeding in encoded text using encoder from step 2.

Training

  • Step one: pre-encode.py encode images into 1024 dimensional embedding space with GoogLeNet [Reference here]
  • Step two: train_text_encoder.py to train text encoder into 1024 dim embedding space using encoded images in step 1
  • Step three: trainer_gan.py to train GAN with the already trained text encoder from step 2. In conf.py you can multi GPU support

Training details

The text encoder was trained on a single Nvidia K80 GPU for 12 hours, The GAN was trained on GCP with 4 Nvidia K80 GPUs for about 8 hours.

This is what the training loss look like:

text encoder (step 2) loss: encoder loss

Generator (step 3) loss:

g loss

Discrimator (also step 3) loss:

d loss

Dataset

Pretrained models

Please PM me

Known issues:

  • The text encoder embedding space might not have been normalized properly, because GAN diverges on certain text descriptions. however a randomly initialized text encoder can almost always ensure stable GAN learning.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].