All Categories → No Category → captioning

Top 8 captioning open source projects

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

✭ 4,713

python deep-learning pytorch dialog pretrained-models vqa captioning multimodal multi-tasking textvqa hateful-memes

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

✭ 33

python vqa attention attention-mechanism captioning visual-question-answering

show-attend-and-tell

No description or website provided.

✭ 25

Jupyter Notebook python captioning

Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

✭ 41

python shell nlp video vision srl captioning captioning-videos vision-and-language grounding video-language event-relations semantic-roles

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

✭ 38

python shell pytorch video-processing lstm representation-learning action-recognition video-understanding c3d video-captioning captioning fine-grained-classification multitask-learning dilated-convolution action-quality-assessment mtl-aqa fine-grained-action-recognition dilated-c3d

ebu-tt-live-toolkit

Toolkit for supporting the EBU-TT Live specification

✭ 23

python Gherkin javascript HTML CSS Makefile Batchfile video live captions subtitles broadcast ebu-tt subtitling captioning

S2VT-seq2seq-video-captioning-attention

S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow

✭ 18

python shell video deep-learning tensorflow seq2seq attention-mechanism captioning

1-8 of 8 captioning projects