All Projects → facebookresearch → QA-Overlap

facebookresearch / QA-Overlap

Licence: other
Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"

Programming Languages

python
139335 projects - #7 most used programming language

QA-Overlap

This repository contains code to support the research paper Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets


Facebook AI Research and UCL NLP


This repository contains both links to download annotations, and an evaluation script to reproduce our evaluation methods.

Data downloads

Data can be dowloaded by running the download.py script, or to download the annotations manually, you can download the links here:

The annoations are in jsonl format. We recommend using them with our provided evaluation scripts

Evaluating your model's Test predictions with our splits

To evaluate your model, first download the data using the download script

pip install -r requirements
python download.py

Then, you can get scores for your models for all QA subsets by running evaluate.py

$ python evaluate.py -h
usage: evaluate.py [-h] [--predictions PREDICTIONS]
                   [--dataset_name {naturalquestions,triviaqa,webquestions}]

optional arguments:
  -h, --help            show this help message and exit
  --predictions PREDICTIONS
                        path to predictions txt file, one answer per line.
                        Answer order should follow the order in
                        data/{dataset}-test.qa.csv
  --dataset_name {naturalquestions,triviaqa,webquestions}
                        name of datset to evaluate on

You will see an output like this:

$ python evaluate.py --predictions rag_nq_predictions.txt --dataset_name nq
--------------------------------------------------
Label       : total
N examples  :  3610
Exact Match :  44.48753462603878
--------------------------------------------------
Label       : question_overlap
N examples  :  324
Exact Match :  70.67901234567901
--------------------------------------------------
Label       : no_question_overlap
N examples  :  672
Exact Match :  31.101190476190474
--------------------------------------------------
Label       : answer_overlap
N examples  :  2297
Exact Match :  55.72485851110144
--------------------------------------------------
Label       : no_answer_overlap
N examples  :  1313
Exact Match :  24.828636709824828
--------------------------------------------------
Label       : answer_overlap_only
N examples  :  315
Exact Match :  34.92063492063492

License

The code in this repository is licenced according the LICENSE file.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].