All Projects → prannayk → videoMultiGAN

prannayk / videoMultiGAN

Licence: other
End to End learning for Video Generation from Text

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to videoMultiGAN

pix2pix-tensorflow
A minimal tensorflow implementation of pix2pix (Image-to-Image Translation with Conditional Adversarial Nets - https://phillipi.github.io/pix2pix/).
Stars: ✭ 22 (-58.49%)
Mutual labels:  dnn, gan
Pytorch Forecasting
Time series forecasting with PyTorch
Stars: ✭ 849 (+1501.89%)
Mutual labels:  learning, deep
Awesome Cybersecurity Datasets
A curated list of amazingly awesome Cybersecurity datasets
Stars: ✭ 380 (+616.98%)
Mutual labels:  learning, deep
Keraspp
코딩셰프의 3분 딥러닝, 케라스맛
Stars: ✭ 178 (+235.85%)
Mutual labels:  dnn, gan
Neural Api
CAI NEURAL API - Pascal based neural network API optimized for AVX, AVX2 and AVX512 instruction sets plus OpenCL capable devices including AMD, Intel and NVIDIA.
Stars: ✭ 94 (+77.36%)
Mutual labels:  learning, deep
Jeelizar
JavaScript object detection lightweight library for augmented reality (WebXR demos included). It uses convolutional neural networks running on the GPU with WebGL.
Stars: ✭ 296 (+458.49%)
Mutual labels:  learning, deep
Variational Autoencoder
Variational autoencoder implemented in tensorflow and pytorch (including inverse autoregressive flow)
Stars: ✭ 807 (+1422.64%)
Mutual labels:  learning, deep
Credit-Card-Fraud
No description or website provided.
Stars: ✭ 17 (-67.92%)
Mutual labels:  deep, dnn
Deeplearning Mindmap
A mindmap summarising Deep Learning concepts.
Stars: ✭ 1,251 (+2260.38%)
Mutual labels:  learning, deep
Ludwig
Data-centric declarative deep learning framework
Stars: ✭ 8,018 (+15028.3%)
Mutual labels:  learning, deep
Deepj
A deep learning model for style-specific music generation.
Stars: ✭ 681 (+1184.91%)
Mutual labels:  learning, deep
Ros robotics projects
Example codes of new book ROS Robotics Projects
Stars: ✭ 240 (+352.83%)
Mutual labels:  learning, deep
Artificialintelligenceengines
Computer code collated for use with Artificial Intelligence Engines book by JV Stone
Stars: ✭ 35 (-33.96%)
Mutual labels:  learning, gan
Djl Demo
Demo applications showcasing DJL
Stars: ✭ 126 (+137.74%)
Mutual labels:  learning, deep
Learningx
Deep & Classical Reinforcement Learning + Machine Learning Examples in Python
Stars: ✭ 241 (+354.72%)
Mutual labels:  learning, deep
IrwGAN
Official pytorch implementation of the IrwGAN for unaligned image-to-image translation
Stars: ✭ 33 (-37.74%)
Mutual labels:  gan
Guilds
Guilds de OSW
Stars: ✭ 38 (-28.3%)
Mutual labels:  learning
sourceseparation misc
No description or website provided.
Stars: ✭ 44 (-16.98%)
Mutual labels:  gan
SeqGAN-PyTorch
Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch
Stars: ✭ 40 (-24.53%)
Mutual labels:  gan
GAN-RNN Timeseries-imputation
Recurrent GAN for imputation of time series data. Implemented in TensorFlow 2 on Wikipedia Web Traffic Forecast dataset from Kaggle.
Stars: ✭ 107 (+101.89%)
Mutual labels:  gan

Video Multi GAN

Video Generation from Text using Tree like decision using GANs. The text annotation or statement is encoded using the LM into a embedding, which then is combined with random vector to generate relevant videos and images.

Video Generation models

  1. VAEGAN
  2. VAEGAN with Latent Variable optimization
  3. VAEGAN with anti reconstruction loss
  4. VAEGAN + Anti reconstruction loss + Latent variable models
  5. variants of above models with different Hyper parameters

Model structure

  • LSTM based model for next frame creation
  • Wasserstein GAN setting discriminator
  • Word embedding based LM
  • Attention based model for classification structure

Training model

  • The relevant models are in Tensorflow >= v1.2
  • Experimentation with above mentioned models
  • The training is done over self generated Bouncing MNIST with sentence based annotation
  • The gensim pre trained fastText wikipedia work embeddings are used for embedding tokens as vectors
  • Non attention based models are used initially to generate starting frames.
  • The GAN tree trains to look for discriminative features (unverified)

Datasets

  1. UCF101 : 3 channel image
  2. Bouncing MNIST

Documentation

  1. We use Sync-DRAW to develop our datasets (https://github.com/syncdraw/Sync-DRAW)
  2. UCF101 is available from University of Montreal
  3. We use multiple GPU training (or a single K80 or Titan X)
  4. Cluster traning is impossible for now

Results will not be updated here since there might be related publications.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].