Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → tsenghungchen → Sa Tensorflow

tsenghungchen / Sa Tensorflow

Soft attention mechanism for video caption generation

Programming Languages

139335 projects - #7 most used programming language

Labels

tensorflow attention-model

Projects that are alternatives of or similar to Sa Tensorflow

The implementation of "Point-of-Interest Recommendation: Exploiting Self-Attentive Autoencoders with Neighbor-Aware Influence"

Stars: ✭ 48 (-68.83%)

Mutual labels: attention-model

Reading comprehension tf

Machine Reading Comprehension in Tensorflow

Stars: ✭ 37 (-75.97%)

Mutual labels: attention-model

Attention Gated Networks

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

Stars: ✭ 1,237 (+703.25%)

Mutual labels: attention-model

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

Stars: ✭ 341 (+121.43%)

Mutual labels: attention-model

Neural Machine Translation with Keras

Stars: ✭ 501 (+225.32%)

Mutual labels: attention-model

Awesome Attention Mechanism In Cv

计算机视觉中用到的注意力模块和其他即插即用模块PyTorch Implementation Collection of Attention Module and Plug&Play Module

Stars: ✭ 54 (-64.94%)

Mutual labels: attention-model

attention-mechanism-keras

attention mechanism in keras, like Dense and RNN...

Stars: ✭ 19 (-87.66%)

Mutual labels: attention-model

Image Caption Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Stars: ✭ 126 (-18.18%)

Mutual labels: attention-model

Text Classification Pytorch

Text classification using deep learning models in Pytorch

Stars: ✭ 683 (+343.51%)

Mutual labels: attention-model

ECG Classification

Stars: ✭ 78 (-49.35%)

Mutual labels: attention-model

The implementation of "End-to-End Multi-Task Learning with Attention" [CVPR 2019].

Stars: ✭ 364 (+136.36%)

Mutual labels: attention-model

Structured Self Attention

A Structured Self-attentive Sentence Embedding

Stars: ✭ 459 (+198.05%)

Mutual labels: attention-model

Deep Visual Attention Prediction (TIP18)

Stars: ✭ 65 (-57.79%)

Mutual labels: attention-model

Attention ocr.pytorch

This repository implements the the encoder and decoder model with attention model for OCR

Stars: ✭ 278 (+80.52%)

Mutual labels: attention-model

Transformer image caption

Image Captioning based on Bottom-Up and Top-Down Attention model

Stars: ✭ 94 (-38.96%)

Mutual labels: attention-model

Caver: a toolkit for multilabel text classification.

Stars: ✭ 38 (-75.32%)

Mutual labels: attention-model

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Stars: ✭ 990 (+542.86%)

Mutual labels: attention-model

Code & data accompanying the NAACL 2019 paper "Bidirectional Attentive Memory Networks for Question Answering over Knowledge Bases"

Stars: ✭ 140 (-9.09%)

Mutual labels: attention-model

Linear Attention Recurrent Neural Network

A recurrent attention module consisting of an LSTM cell which can query its own past cell states by the means of windowed multi-head attention. The formulas are derived from the BN-LSTM and the Transformer Network. The LARNN cell with attention can be easily used inside a loop on the cell state, just like any other RNN. (LARNN)

Stars: ✭ 119 (-22.73%)

Mutual labels: attention-model

Pytorch Attention Guided Cyclegan

Pytorch implementation of Unsupervised Attention-guided Image-to-Image Translation.

Stars: ✭ 67 (-56.49%)

Mutual labels: attention-model

View All Similar Projects ➔

SA-tensorflow

Tensorflow implementation of soft-attention mechanism for video caption generation.

An example of soft-attention mechanism. The attention weight alpha indicates the temporal attention in one video based on each word.

[Yao et al. 2015 Describing Videos by Exploiting Temporal Structure] The original code implemented in Torch can be found here.

Prerequisites

Python 2.7
Tensorflow >= 0.7.1
NumPy
pandas
keras
java 1.8.0

Data

The MSVD [2] dataset can be download from here.

We pack the data into the format of HDF5, where each file is a mini-batch for training and has the following keys:

[u'data', u'fname', u'label', u'title']

batch['data'] stores the visual features. shape (n_step_lstm, batch_size, hidden_dim)

batch['fname'] stores the filenames(no extension) of videos. shape (batch_size)

batch['title'] stores the description. If there are multiple sentences correspond to one video, the other metadata such as visual features, filenames and labels have to duplicate for one-to-one mapping. shape (batch_size)

batch['label'] indicates where the video ends. For instance, [-1., -1., -1., -1., 0., -1., -1.] means that the video ends at index 4.

shape (n_step_lstm, batch_size)

Generate HDF5 data

We generate the HDF5 data by following the steps below. The codes are a little messy. If you have any questions, feel free to ask.

1. Generate Label

Once you change the video_path and output_path, you can generate labels by running the script:

python hdf5_generator/generate_nolabel.py

I set the length of each clip to 10 frames and the maximum length of frames to 450. You can change the parameters in function get_frame_list(frame_num).

2. Pack features together (no caption information)

Inputs:

label_path: The path for the labels generated earlier.

feature_path: The path that stores features such as VGG and C3D. You can change the directory name whatever you want.

Ouputs:

h5py_path: The path that you store the concatenation of different features, the code will automatically put the features in the subdirectory cont

python hdf5_generator/input_generator.py

Note that in function get_feats_depend_on_label(), you can choose whether to take the mean feature or random sample feature of frames in one clip. The random sample script is commented out since the performance is worse.

3. Add captions into HDF5 data

I set the maxmimum number of words in a caption to 35. feature folder is where our final output features store.

python hdf5_generator/trans_video_youtube.py

(The codes here are written by Kuo-Hao)

Generate data list

video_data_path_train = '$ROOTPATH/SA-tensorflow/examples/train_vn.txt'

You can change the path variable to the absolute path of your data. Then simply run python getlist.py to generate the list.

P.S. The filenames of HDF5 data start with train, val, test.

Usage

training

$ python Att.py --task train

testing

Test the model after a certain number of training epochs.

$ python Att.py --task test --net models/model-20

Author

Tseng-Hung Chen

Disclaimer

We modified the code from this repository jazzsaxmafia/video_to_sequence to the temporal-attention model.

References

[1] L. Yao, A. Torabi, K. Cho, N. Ballas, C. Pal, H. Larochelle, and A. Courville. Describing videos by exploiting temporal structure. arXiv:1502.08029v4, 2015.

[2] chen:acl11, title = "Collecting Highly Parallel Data for Paraphrase Evaluation", author = "David L. Chen and William B. Dolan", booktitle = "Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL-2011)", address = "Portland, OR", month = "June", year = 2011

[3] Microsoft COCO Caption Evaluation

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 154

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (8) 🔗