Alternatives and detailed information of CondensedMovies

m-bain / CondensedMovies

Licence: other

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Programming Languages

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to CondensedMovies

OpenDialog

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (+20.51%)

Mutual labels: retrieval

shrec17

Supplementary code for SHREC 2017 RGB-D Object-to-CAD Retrieval track

Stars: ✭ 27 (-65.38%)

Mutual labels: retrieval

MHCLN

Deep Metric and Hash Code Learning Network for Content Based Retrieval of Remote Sensing Images

Stars: ✭ 30 (-61.54%)

Mutual labels: retrieval

Lucenenet

Apache Lucene.NET

Stars: ✭ 1,704 (+2084.62%)

Mutual labels: retrieval

plexus

Plexus - Interactive Emotion Visualization based on Social Media

Stars: ✭ 27 (-65.38%)

Mutual labels: retrieval

CVPR2020 PADS

(CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet mining with Reinforcement Learning.

Stars: ✭ 57 (-26.92%)

Mutual labels: retrieval

vitrivr-ng

vitrivr NG is a web-based user interface for searching and browsing mixed multimedia collections. It uses cineast as a backend

Stars: ✭ 14 (-82.05%)

Mutual labels: retrieval

cottontaildb

Cottontail DB is a column store aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.

Stars: ✭ 16 (-79.49%)

Mutual labels: retrieval

awesome-visual-localization-papers

The relocalization task aims to estimate the 6-DoF pose of a novel (unseen) frame in the coordinate system given by the prior model of the world.

Stars: ✭ 60 (-23.08%)

Mutual labels: retrieval

UDLF

An Unsupervised Distance Learning Framework for Multimedia Retrieval

Stars: ✭ 40 (-48.72%)

Mutual labels: retrieval

palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Stars: ✭ 32 (-58.97%)

Mutual labels: retrieval

image embeddings

Using efficientnet to provide embeddings for retrieval

Stars: ✭ 107 (+37.18%)

Mutual labels: retrieval

tf retrieval baseline

A Tensorflow retrieval (space embedding) baseline. Metric learning baseline on CUB and Stanford Online Products.

Stars: ✭ 39 (-50%)

Mutual labels: retrieval

deep recommenders

Deep Recommenders

Stars: ✭ 214 (+174.36%)

Mutual labels: retrieval

MoTIS

Mobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.

Stars: ✭ 60 (-23.08%)

Mutual labels: retrieval

COIL

NAACL2021 - COIL Contextualized Lexical Retriever

Stars: ✭ 86 (+10.26%)

Mutual labels: retrieval

cineast

Cineast is a multi-feature content-based mulitmedia retrieval engine. It is capable of retrieving images, audio- and video sequences as well as 3d models based on edge or color sketches, textual descriptions and example objects.

Stars: ✭ 51 (-34.62%)

Mutual labels: retrieval

AudioAlign

Audio Synchronization and Analysis Tool

Stars: ✭ 80 (+2.56%)

Mutual labels: retrieval

FieldedSDM

Fielded Sequential Dependence Model (code and runs)

Stars: ✭ 32 (-58.97%)

Mutual labels: retrieval

RETRO-pytorch

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Stars: ✭ 473 (+506.41%)

Mutual labels: retrieval

View All Similar Projects ➔

CondensedMovies

**** N.B: Please use the condensed movies challenge https://github.com/m-bain/CondensedMovies-chall with updated splits since some videos in the original paper are unavailable with missing features ****_

You can contact me directly for the raw videos and details in the challenge repo for feature download.

###############################################

This repository contains the video dataset, implementation and baselines from Condensed Movies: Story Based Retrieval with Contextual Embeddings.

Project page | arXiv preprint | Read the paper | Preview the data

CondensedMovies Dataset

The dataset consists of 3K+ movies, 30K+ professionally captioned clips, 1K+ video hours, 400K+ facetracks & precomputed features from 6 different modalities.

Installation

Requirements:

Storage
- 20GB for features (required for baseline experiments)
- 10GB for facetracks
- 250GB for videos
Libraries
- ffmpeg (for video download)
- youtube-dl (for video download)
- pandas, numpy
- python 3.6+

Prepare Data

Navigate to directory cd CondensedMovies/data_prep/
Edit configuration file config.json to download desired subsets of the dataset and their destination.
If downloading the source videos (src: true), you can edit youtube-dl.conf for desired resolution, subtitles etc. Please see youtube-dl for more info
Run python download.py

If you have trouble downloading the source videos or features (due to geographical restrictions or otherwise), please contact me.

Video-Text Retrieval

Baseline (Mixture of Expert Embeddings)

Edit data_dir and save_dir in configs/moe.json for the experiments.

python train.py configs/moe.json
python test.py --resume $SAVED_EXP_DIR/model_best.pth

Visualisation

Run python visualise_face_tracks.py with the appropriate arguments to visualise face tracks for a given videoID (requires facetracks and source videos downloaded).

TODO:

FAQ

Why did some of the source videos fail to download?

This is most likely due to geographical restrictions on the videos, email me at [email protected] and I can help.

The precomputed features are averaged over the temporal dimension, will you release the original features?

This is to save space, original features in total are ~1TB, contact me to arrange download of this.

I think clip X is incorrectly identified as being from movie Y, what do do?

Please let me know any movie identification mistakes and I'll correct it ASAP.

Acknowledgements

We would like to thank Samuel Albanie for his help with feature extraction.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

m-bain / CondensedMovies

Programming Languages

Labels

Projects that are alternatives of or similar to CondensedMovies

CondensedMovies

CondensedMovies Dataset

Installation

Prepare Data

Video-Text Retrieval

Baseline (Mixture of Expert Embeddings)

Visualisation

TODO:

FAQ

Acknowledgements