1. Clipbert[CVPR 2021 Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning for image-text and video-text tasks.
2. Tvqa[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
3. Recurrent Transformer[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
4. AnimeganA simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.
6. TVQAplus[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
8. TVCaption[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
9. moment detr[NeurIPS 2021] Moment-DETR code and QVHighlights dataset