All Projects → m-bain → CondensedMovies

m-bain / CondensedMovies

Licence: other
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to CondensedMovies

OpenDialog
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
Stars: ✭ 94 (+20.51%)
Mutual labels:  retrieval
shrec17
Supplementary code for SHREC 2017 RGB-D Object-to-CAD Retrieval track
Stars: ✭ 27 (-65.38%)
Mutual labels:  retrieval
MHCLN
Deep Metric and Hash Code Learning Network for Content Based Retrieval of Remote Sensing Images
Stars: ✭ 30 (-61.54%)
Mutual labels:  retrieval
Lucenenet
Apache Lucene.NET
Stars: ✭ 1,704 (+2084.62%)
Mutual labels:  retrieval
plexus
Plexus - Interactive Emotion Visualization based on Social Media
Stars: ✭ 27 (-65.38%)
Mutual labels:  retrieval
CVPR2020 PADS
(CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet mining with Reinforcement Learning.
Stars: ✭ 57 (-26.92%)
Mutual labels:  retrieval
vitrivr-ng
vitrivr NG is a web-based user interface for searching and browsing mixed multimedia collections. It uses cineast as a backend
Stars: ✭ 14 (-82.05%)
Mutual labels:  retrieval
cottontaildb
Cottontail DB is a column store aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
Stars: ✭ 16 (-79.49%)
Mutual labels:  retrieval
awesome-visual-localization-papers
The relocalization task aims to estimate the 6-DoF pose of a novel (unseen) frame in the coordinate system given by the prior model of the world.
Stars: ✭ 60 (-23.08%)
Mutual labels:  retrieval
UDLF
An Unsupervised Distance Learning Framework for Multimedia Retrieval
Stars: ✭ 40 (-48.72%)
Mutual labels:  retrieval
palladian
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Stars: ✭ 32 (-58.97%)
Mutual labels:  retrieval
image embeddings
Using efficientnet to provide embeddings for retrieval
Stars: ✭ 107 (+37.18%)
Mutual labels:  retrieval
tf retrieval baseline
A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.
Stars: ✭ 39 (-50%)
Mutual labels:  retrieval
deep recommenders
Deep Recommenders
Stars: ✭ 214 (+174.36%)
Mutual labels:  retrieval
MoTIS
Mobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
Stars: ✭ 60 (-23.08%)
Mutual labels:  retrieval
COIL
NAACL2021 - COIL Contextualized Lexical Retriever
Stars: ✭ 86 (+10.26%)
Mutual labels:  retrieval
cineast
Cineast is a multi-feature content-based mulitmedia retrieval engine. It is capable of retrieving images, audio- and video sequences as well as 3d models based on edge or color sketches, textual descriptions and example objects.
Stars: ✭ 51 (-34.62%)
Mutual labels:  retrieval
AudioAlign
Audio Synchronization and Analysis Tool
Stars: ✭ 80 (+2.56%)
Mutual labels:  retrieval
FieldedSDM
Fielded Sequential Dependence Model (code and runs)
Stars: ✭ 32 (-58.97%)
Mutual labels:  retrieval
RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Stars: ✭ 473 (+506.41%)
Mutual labels:  retrieval

CondensedMovies

**** N.B: Please use the condensed movies challenge https://github.com/m-bain/CondensedMovies-chall with updated splits since some videos in the original paper are unavailable with missing features ****_

You can contact me directly for the raw videos and details in the challenge repo for feature download.

###############################################

This repository contains the video dataset, implementation and baselines from Condensed Movies: Story Based Retrieval with Contextual Embeddings.

Project page | arXiv preprint | Read the paper | Preview the data


CondensedMovies Dataset

videocaptions

The dataset consists of 3K+ movies, 30K+ professionally captioned clips, 1K+ video hours, 400K+ facetracks & precomputed features from 6 different modalities.

Installation

Requirements:

  • Storage
    • 20GB for features (required for baseline experiments)
    • 10GB for facetracks
    • 250GB for videos
  • Libraries
    • ffmpeg (for video download)
    • youtube-dl (for video download)
    • pandas, numpy
    • python 3.6+

Prepare Data

  1. Navigate to directory cd CondensedMovies/data_prep/
  2. Edit configuration file config.json to download desired subsets of the dataset and their destination.
  3. If downloading the source videos (src: true), you can edit youtube-dl.conf for desired resolution, subtitles etc. Please see youtube-dl for more info
  4. Run python download.py

If you have trouble downloading the source videos or features (due to geographical restrictions or otherwise), please contact me.

Video-Text Retrieval

Baseline (Mixture of Expert Embeddings)

Edit data_dir and save_dir in configs/moe.json for the experiments.

  1. python train.py configs/moe.json
  2. python test.py --resume $SAVED_EXP_DIR/model_best.pth

Visualisation

Run python visualise_face_tracks.py with the appropriate arguments to visualise face tracks for a given videoID (requires facetracks and source videos downloaded).

TODO:

  • youtube download script
  • missing videos check
  • precomputed features download script
  • facetrack visualisation
  • dataloader
  • video-text retrieval baselines
  • intra-movie baselines + char module
  • release fixed_seg features

FAQ

Why did some of the source videos fail to download?

This is most likely due to geographical restrictions on the videos, email me at [email protected] and I can help.

The precomputed features are averaged over the temporal dimension, will you release the original features?

This is to save space, original features in total are ~1TB, contact me to arrange download of this.

I think clip X is incorrectly identified as being from movie Y, what do do?

Please let me know any movie identification mistakes and I'll correct it ASAP.

Acknowledgements

We would like to thank Samuel Albanie for his help with feature extraction.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].