All Projects → zsdonghao → im2txt2im

zsdonghao / im2txt2im

Licence: other
I2T2I: Text-to-Image Synthesis with textual data augmentation

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to im2txt2im

Vae Celeba
Variational auto-encoder trained on celebA . All rights reserved.
Stars: ✭ 160 (+451.72%)
Mutual labels:  tensorlayer
idg
Document image generator
Stars: ✭ 40 (+37.93%)
Mutual labels:  text-to-image
CLIP-Guided-Diffusion
Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.
Stars: ✭ 328 (+1031.03%)
Mutual labels:  text-to-image
Awesome Tensorlayer
A curated list of dedicated resources and applications
Stars: ✭ 248 (+755.17%)
Mutual labels:  tensorlayer
universum-contracts
text-to-image generation gems / libraries incl. moonbirds, cyberpunks, coolcats, shiba inu doge, nouns & more
Stars: ✭ 17 (-41.38%)
Mutual labels:  text-to-image
deep learning
deep learning practice
Stars: ✭ 32 (+10.34%)
Mutual labels:  tensorlayer
Ganspapercollection
Stars: ✭ 130 (+348.28%)
Mutual labels:  tensorlayer
ImageToText
OCR with Google's AI technology (Cloud Vision API)
Stars: ✭ 30 (+3.45%)
Mutual labels:  image-to-text
ru-dalle
Generate images from texts. In Russian
Stars: ✭ 1,606 (+5437.93%)
Mutual labels:  text-to-image
Data-Whisperer
An NLP text to vizualization builder for Tableau.
Stars: ✭ 13 (-55.17%)
Mutual labels:  text-to-image
Tesseract Ocr For Php
A wrapper to work with Tesseract OCR inside PHP.
Stars: ✭ 2,247 (+7648.28%)
Mutual labels:  image-to-text
VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Stars: ✭ 2,369 (+8068.97%)
Mutual labels:  text-to-image
card-scanner-flutter
A flutter package for Fast, Accurate and Secure Credit card & Debit card scanning
Stars: ✭ 82 (+182.76%)
Mutual labels:  image-to-text
Srgan
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Stars: ✭ 2,641 (+9006.9%)
Mutual labels:  tensorlayer
text-to-image
Re-implementation of https://github.com/zsdonghao/text-to-image
Stars: ✭ 25 (-13.79%)
Mutual labels:  text-to-image
Chinese Book
《深度学习:一起玩转TensorLayer》资源分享、讨论
Stars: ✭ 137 (+372.41%)
Mutual labels:  tensorlayer
pretrained-models
No description or website provided.
Stars: ✭ 76 (+162.07%)
Mutual labels:  tensorlayer
keras-text-to-image
Translate text to image in Keras using GAN and Word2Vec as well as recurrent neural networks
Stars: ✭ 60 (+106.9%)
Mutual labels:  text-to-image
text-to-image
Text to Image Synthesis using Generative Adversarial Networks
Stars: ✭ 72 (+148.28%)
Mutual labels:  text-to-image
feed forward vqgan clip
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
Stars: ✭ 135 (+365.52%)
Mutual labels:  text-to-image

Image Captioning and Text-to-Image Synthesis with textual data augmentation

This code run well under python2.7 and TensorFlow 0.11, if you use higher version of TensorFlow you may need to update the tensorlayer folder from TensorLayer Lib.

Usage

1. Prepare MSCOCO data and Inception model

  • Before you run the scripts, you need to follow Google's setup guide, and setup the model, ckpt and data directories in *.py.
  • Creat a data folder.
  • Download and Preprocessing MSCOCO Data click here.
  • Download the Inception_V3 CKPT click here.

2. Train image captioning model

  • Train your image captioning model on MSCOCO by following my other repo.

3. Setup your paths

  • in train_im2txt2im_coco_64.py
  • config your image directory here images_train_dir = '/home/.../mscoco/raw-data/train2014/'
  • config the vocabulary and model of you image captioning module DIR = "/home/..."
  • directory containing model checkpoints. CHECKPOINT_DIR = DIR + "/model/train"
  • vocabulary file generated by the preprocessing script. VOCAB_FILE = DIR + "/data/mscoco/word_counts.txt"

4. Train text-to-image synthesis with image captioning

  • model_im2txt.py model for image captioning
  • train_im2txt2im_coco_64.py script for training I2T2I
  • utils.py script for utility functions

Results

1. Here are some results on MSCOCO

2. Transfer learning on MHP dataset

Citation

  • If you find it is useful, please cite:
@article{hao2017im2txt2im,
  title={I2T2I: LEARNING TEXT TO IMAGE SYNTHESIS WITH TEXTUAL DATA AUGMENTATION},
  author={Hao Dong, Jingqing Zhang, Douglas McIlwraith, Yike Guo},
  journal={ICIP},
  year={2017}
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].