seanie12 / mrqa

Licence: Apache-2.0 license

Code for EMNLP-IJCNLP 2019 MRQA Workshop Paper: "Domain-agnostic Question-Answering with Adversarial Training"

Programming Languages

python

139335 projects - #7 most used programming language

shell

77523 projects

Projects that are alternatives of or similar to mrqa

explicit memory tracker

[ACL 2020] Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading

Stars: ✭ 35 (+0%)

Mutual labels: question-answering

MLH-Quizzet

This is a smart Quiz Generator that generates a dynamic quiz from any uploaded text/PDF document using NLP. This can be used for self-analysis, question paper generation, and evaluation, thus reducing human effort.

Stars: ✭ 23 (-34.29%)

Mutual labels: question-answering

PersianQA

Persian (Farsi) Question Answering Dataset (+ Models)

Stars: ✭ 114 (+225.71%)

Mutual labels: question-answering

extractive rc by runtime mt

Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"

Stars: ✭ 36 (+2.86%)

Mutual labels: question-answering

ODSQA

ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET

Stars: ✭ 43 (+22.86%)

Mutual labels: question-answering

iPerceive

Stars: ✭ 52 (+48.57%)

Mutual labels: question-answering

verseagility

Ramp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.

Stars: ✭ 23 (-34.29%)

Mutual labels: question-answering

PororoQA

PororoQA, https://arxiv.org/abs/1707.00836

Stars: ✭ 26 (-25.71%)

Mutual labels: question-answering

question-answering

No description or website provided.

Stars: ✭ 32 (-8.57%)

Mutual labels: question-answering

denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)

Stars: ✭ 188 (+437.14%)

Mutual labels: question-answering

QA HRDE LTC

TensorFlow implementation of "Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering," NAACL-18

Stars: ✭ 29 (-17.14%)

Mutual labels: question-answering

NCE-CNN-Torch

Noise-Contrastive Estimation for Question Answering with Convolutional Neural Networks (Rao et al. CIKM 2016)

Stars: ✭ 54 (+54.29%)

Mutual labels: question-answering

GAR

Code and resources for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021

Stars: ✭ 38 (+8.57%)

Mutual labels: question-answering

gated-attention-reader

Tensorflow/Pytorch implementation of Gated Attention Reader

Stars: ✭ 37 (+5.71%)

Mutual labels: question-answering

finance-qa-spider

金融问答平台文本数据采集/爬取，数据源涉及上交所，深交所，全景网及新浪股吧

Stars: ✭ 33 (-5.71%)

Mutual labels: question-answering

ssdg-benchmark

Benchmarks for semi-supervised domain generalization.

Stars: ✭ 46 (+31.43%)

Mutual labels: domain-generalization

patrick-wechat

⭐️🐟 questionnaire wechat app built with taro, taro-ui and heart. 微信问卷小程序

Stars: ✭ 74 (+111.43%)

Mutual labels: question-answering

SQUAD2.Q-Augmented-Dataset

Augmented version of SQUAD 2.0 for Questions

Stars: ✭ 31 (-11.43%)

Mutual labels: question-answering

deformer

[ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering

Stars: ✭ 111 (+217.14%)

Mutual labels: question-answering

QA4IE

Original implementation of QA4IE

Stars: ✭ 24 (-31.43%)

Mutual labels: question-answering

View All Similar Projects ➔

Domain-agnostic Question-Answering with Adversarial Training

Code for our paper "Domain-agnostic Question-Answering with Adversarial Training" which is accepted by EMNLP-IJCNLP 2019 MRQA Workshop.

Model Architecture

Data Preparation

Download the original data

Download the data by running shell file.
Then run the code. Preprocessed train data will be created before training (It will takes quite a long time)

$ cd data
$ ./download_data.sh

(Optional) Download the pickled data (for fast data loading)

Download the pickled data from this link.
Unzip the zipfile on the root directory.

.
├── ...
├── pickled_data_bert-base-uncased_False
│   ├── HotpotQA.pkl
│   ├── NaturalQuestions.pkl
│   ├── NewsQA.pkl
│   ├── SQuAD.pkl
│   ├── SearchQA.pkl
│   └── TriviaQA.pkl
└── ...

Arguments should be same as below if you use pickled data. If you want to change one of these two arguments.

parser.add_argument("--bert_model", default="bert-base-uncased", type=str, help="Bert model")
parser.add_argument("--skip_no_ans", action='store_true', default=False, help="whether to exclude no answer example")

Requirements

Please install the following library requirements specified in the requirements.txt first.

torch==1.1.0
pytorch-pretrained-bert>=0.6.2
json-lines>=0.5.0

Model Training & Validation

$ python3 main.py \
         --epochs 2 \
         --batch_size 64 \
         --lr 3e-5 \
         --do_lower_case \
         --use_cuda \
         --do_valid \
         --adv \
         --dis_lambda 0.01

If you are using uncased bert model, give the option --do_lower_case.
If you want to do validation, give the option --do_valid.

Reference

@inproceedings{lee-etal-2019-domain,
    title={Domain-agnostic Question-Answering with Adversarial Training},
    author={Seanie Lee and Donggyu Kim and Jangwon Park},
    booktitle={Proceedings of the 2nd Workshop on Machine Reading for Question Answering},
    publisher={Association for Computational Linguistics},
    year={2019},
    url={https://www.aclweb.org/anthology/D19-5826},
}

Contributors

Lee, Seanie (https://seanie12.github.io/)
Kim, Donggyu (https://github.com/donggyukimc)
Park, Jangwon (https://github.com/monologg)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

seanie12 / mrqa

Programming Languages

Labels

Projects that are alternatives of or similar to mrqa

Domain-agnostic Question-Answering with Adversarial Training

Model Architecture

Data Preparation

Download the original data

(Optional) Download the pickled data (for fast data loading)

Requirements

Model Training & Validation

Reference

Contributors