All Projects → dialogtekgeek → AVSD-DSTC10_Official

dialogtekgeek / AVSD-DSTC10_Official

Licence: MIT license
Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC)

Projects that are alternatives of or similar to AVSD-DSTC10 Official

Qrn
Query-Reduction Networks (QRN)
Stars: ✭ 137 (+522.73%)
Mutual labels:  qa, dialog
dialogbot
dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。
Stars: ✭ 96 (+336.36%)
Mutual labels:  qa, dialog
AudioVisualSceneAwareDialog
No description or website provided.
Stars: ✭ 22 (+0%)
Mutual labels:  dialog, audio-visual
panter-dialog
Panter Dialog is an stylish android library that helps users add cool features like adding header and header logos
Stars: ✭ 116 (+427.27%)
Mutual labels:  dialog
denbun
Adjust showing frequency of Android app messages, and to be more user friendly 🐦
Stars: ✭ 17 (-22.73%)
Mutual labels:  dialog
relint
General purpose RegEx based file linter.
Stars: ✭ 33 (+50%)
Mutual labels:  qa
permuted-bAbI-dialog-tasks
Dataset for 'Learning End-to-End Goal-Oriented Dialog with Multiple Answers' EMNLP 2018
Stars: ✭ 17 (-22.73%)
Mutual labels:  dialog
ExploratoryTestingChromeExtension
Exploratory Testing Chrome Extension
Stars: ✭ 26 (+18.18%)
Mutual labels:  qa
opensnips
Open source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+127.27%)
Mutual labels:  dialog
plain-modal
The simple library for customizable modal window.
Stars: ✭ 21 (-4.55%)
Mutual labels:  dialog
Android-Shortify
An Android library used for making an Android application more faster with less amount of code. Shortify for Android provides basic functionalities of view and resource binding, view customization, JSON parsing, AJAX, various readymade dialogs and much more.
Stars: ✭ 21 (-4.55%)
Mutual labels:  dialog
smart-show
Toast # Snackbar # Dialog
Stars: ✭ 500 (+2172.73%)
Mutual labels:  dialog
react-native-wxui
A UI package for React Native
Stars: ✭ 21 (-4.55%)
Mutual labels:  dialog
HijriDatePicker
Material (Gregorian - Hijri) Date & Time Picker
Stars: ✭ 128 (+481.82%)
Mutual labels:  dialog
Licenser
An android library to display the licenses of your application libraries in a easy way.
Stars: ✭ 75 (+240.91%)
Mutual labels:  dialog
php-qa-tools
A metapackage for all PHP quality assurance tools we use at Dealerdirect
Stars: ✭ 36 (+63.64%)
Mutual labels:  qa
ngx-modal
Dynamic modal dialog for Angular
Stars: ✭ 54 (+145.45%)
Mutual labels:  dialog
bootstrap-cookie-consent-settings
A modal dialog (cookie banner) and framework to handle the German and EU law about cookies in a website. Needs Bootstrap.
Stars: ✭ 25 (+13.64%)
Mutual labels:  dialog
DSTC6-End-to-End-Conversation-Modeling
DSTC6: End-to-End Conversation Modeling Track
Stars: ✭ 56 (+154.55%)
Mutual labels:  dialog
TheBashMenu
A useful bash script allowing you to easily create your own menu, which uses the directional keys! Quickly add your title, options and commands and you're good to go!
Stars: ✭ 52 (+136.36%)
Mutual labels:  dialog

AVSD-DSTC10 Official

Audio Visual Scene-Aware Dialog (AVSD) Challenge at the 10th Dialog System Technology Challenge (DSTC) https://sites.google.com/dstc.community/dstc10

News

System submission site is now open:

https://docs.google.com/forms/d/e/1FAIpQLSe6FNXNhpb-VjiILamx6EjR_ducdRLgnP9keh2fa-Q8WvrcJQ/viewform
To upload your files, please use your google account.

The baseline system and the validation data for reasoning timing is now releasing:

After register DSTC10 and then get the baseline system from HERE

Please Register HERE for participating in the DSTC10 challenge

Goal: Reasoning for Audio Visual Scene-Aware Dialog

The task setup for the previous challenges in DSTC7 and DSTC8 allowed the participants 
to use human-created video captions to generate answers for the dialog questions. 
However, such manual descriptions are not available in real-world applications, where the 
system needs to learn to produce the answers without the captions. 
To encourage progress towards this end, we propose a third challenge
in DSTC10 under the video-based scene-aware dialog track. 
In this challenge, we seek evidence from the system to support the generated answer 
via detecting the temporal segments in the videos corresponding to the answer.

Schedule

June 14th, 2021: Answer generation data release
June 30th, 2021: Answer reasoning temporal localization data and baseline release: 
   **Releasing  AVSD@DSTC10 registrants only**

September 13th, 2021: Test Data release
September 21st 2021: Test Submission due -> 28th

November 1st 2021: Challenge paper submission due
January or February, 2022: Workshop

Task definition

Task 1: Video QA dialog

Goal: Answer generation without using manual descriptions for inference
You can train models using manual descriptions but CANNOT use them for testing. 
Video description capability needs to be embedded within the answer generation models.

Data conditions:
a. Use the provided video including audio and video features and localization information with answer reasoning, 
   the dialog history and manual video descriptions (scripts and summary) data for training for the Closed condition
    - text descriptions or scripts used by the actor to enact in the videos
    - summary generated by the questioners after holding 10 QAs.

b. Publicly available external data and pre-trained models may also be used for training as a sub task for the Open condition

Task 2: Reasoning Video QA dialog

Goal: Answer reasoning temporal Localization 

To support answers, evidence is required to be shown without using manual descriptions. 
For example, When a system generated answer is “A dog is barking.”, the sound of the dog’s barking and
the dog must be grounded in the video as evidence.
The localization of audiovisual evidence is required for each generated answer.
To train reasoning localization, begin and end timing of the grounding/evidence will be additionally provided for the training data.

Data conditions:
Temporal localization information for answer reasoning is provided as begin and end timestamps showing evidence scenes
a. Use the provided audio and video features with localization information with answer reasoning only for the Closed condition
b. Any publicly available data and pre-trained models may also be used for training as a sub task for the Open condition

Evaluation

Likert scale by 5 humans + Similarity compared with single and multiple ground truths
Intersection over Union (IoU) by comparing with “single” Evidence timing 

Data Collection Method for Reasoning

Data Collection for reasoning

Baseline system

The baseline system is based on Audio Visual Transformer for dialogue response sentence generation.
Information on gaining access to baseline is HERE (Registration is required.)

Output: 
Answer generation considering dialog context 
Evidence timing detection based on attention weights 

Evaluation: 
Validation data (1,787) was evaluated using “single” Answer and Evidence timing 
Sentence similarity: BLEU, METEOR, CIDEr 
Timing overlap: Intersection over Union (IoU)  
Official evaluation: 
  - Likert scale by 5 humans 
  - Similarity compared with single and multiple ground truths 

Additional Data:  
Evidence timing for Training data (7,659) will be provided soon. 

- Contact Information

Join the DSTC mailing list to get the latest updates about DSTC10

DSTC10 Task4 specific inquiries

AVSD@DSTC10 Organizer: Chiori Hori & Ankit Shah

Task Organizers

Ankit Shah, Shijie Geng, Peng Gao, Anoop Cherian, Chiori Hori and Tim K. Marks

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].