Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → AkariAsai → Logic_guided_qa

AkariAsai / Logic_guided_qa

Licence: mit

The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".

Programming Languages

139335 projects - #7 most used programming language

Labels

question-answering

Projects that are alternatives of or similar to Logic guided qa

End-to-end neural table-text understanding models.

Stars: ✭ 583 (+960%)

Mutual labels: question-answering

Bert language understanding

Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN

Stars: ✭ 933 (+1596.36%)

Mutual labels: question-answering

Cnn Question Classification Keras

Chinese Question Classifier (Keras Implementation) on BQuLD

Stars: ✭ 28 (-49.09%)

Mutual labels: question-answering

😎 A curated list of the Question Answering (QA)

Stars: ✭ 596 (+983.64%)

Mutual labels: question-answering

Insuranceqa Corpus Zh

🚁 保险行业语料库，聊天机器人

Stars: ✭ 821 (+1392.73%)

Mutual labels: question-answering

Self-contained Machine Learning and Natural Language Processing library in Go

Stars: ✭ 854 (+1452.73%)

Mutual labels: question-answering

基于自然语言理解与机器学习的聊天机器人，支持多用户并发及自定义多轮对话

Stars: ✭ 516 (+838.18%)

Mutual labels: question-answering

🔎 Search the information available on a webpage using natural language instead of an exact string match.

Stars: ✭ 1,023 (+1760%)

Mutual labels: question-answering

Deep Embedded Memory Networks

https://arxiv.org/abs/1707.00836

Stars: ✭ 19 (-65.45%)

Mutual labels: question-answering

Zeronet Dev Center

A Development Center for the ZeroNet. Tutorials on ZeroNet Zite Development, Collaboration, and Questions

Stars: ✭ 21 (-61.82%)

Mutual labels: question-answering

基于多搜索引擎和深度学习技术的自动问答

Stars: ✭ 602 (+994.55%)

Mutual labels: question-answering

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+12001.82%)

Mutual labels: question-answering

Kg Demo For Movie

从无到有构建一个电影知识图谱，并基于该KG，开发一个简易的KBQA程序。

Stars: ✭ 876 (+1492.73%)

Mutual labels: question-answering

An open source library for deep learning end-to-end dialog systems and chatbots.

Stars: ✭ 5,525 (+9945.45%)

Mutual labels: question-answering

Code to reproduce results in our ACL 2018 paper "Did the Model Understand the Question?"

Stars: ✭ 31 (-43.64%)

Mutual labels: question-answering

Memn2n Babi Python

End-To-End Memory Networks for bAbI question-answering tasks

Stars: ✭ 570 (+936.36%)

Mutual labels: question-answering

Knowledge Graphs

A collection of research on knowledge graphs

Stars: ✭ 845 (+1436.36%)

Mutual labels: question-answering

⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.

Stars: ✭ 48 (-12.73%)

Mutual labels: question-answering

Conversational Ai

Conversational AI Reading Materials

Stars: ✭ 34 (-38.18%)

Mutual labels: question-answering

Keras Question And Answering Web Api

Question answering system developed using seq2seq and memory network model in Keras

Stars: ✭ 21 (-61.82%)

Mutual labels: question-answering

View All Similar Projects ➔

Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

This is the original implementation of the following paper.

Akari Asai and Hannaneh Hajishirzi. Logic-Guided Data Augmentation and Regularization for Consistent Question Answering. In: Proceedings of ACL (short). 2020.

@inproceedings{asai2020logic,
  title={Logic-Guided Data Augmentation and Regularization for Consistent Question Answering},
  author={Asai, Akari and Hajishirzi, Hannaneh},
  booktitle={ACL},
  year={2020}
}

In the paper, we introduce logic-guided data augmentation and regularization to improve accuracy and consistency in a range of question answering datasets, namely WIQA, QuaRel and HotpotQA (comparison). This repository contains example codes for WIQA and HotpotQA.

Acknowledgements: To implement our RoBERTa-based baselines for WIQA and HotpotQA, we used the Hugging Face's transformers library. Huge thanks to the contributors of Hugging Face library! Niket helps us to reproduce the original baseline results from the WIQA paper.

Currently this repository contains example codes for WIQA. Codes for other datasets will be added soon.

0. Setup

Install python packages

Run the command below to install python packages.

pip -r requirements.txt

Download data

Run download.sh to download the original WIQA.

bash download.sh

1. Data augmentation

After download the data, run the command below.

WIQA

python wiqa_augmentation.py --data_dir PATH_TO_WIQA_DATA_DIR --output_dir PATH_TO_AUGMENTED_DATA

Optional argument:

optional arguments:
  --store_dev_test_augmented_data
                        set true to augment dev/test data. By default we keep eval data as is.
  --sample_ratio SAMPLE_RATIO
                        the random sample rate for original data to be used.
  --sample_ratio_augmentation SAMPLE_RATIO_AUGMENTATION
                        the random sample rate for augmented data to be added.
  --eval_mode           with eval mode, we only add the question included in
                        the original data.

2. Training

You can train our models from scratch or download the checkpoints we trained. The pre-trained weights can be downloaded from here.

WIQA

You can train our RoBERTa baseline, RoBERTa+DA, and RoBERTa + DA + Consistency models by running the commands below.

You can reduce the number of gradient_accumulation_steps if you use multiple GPUs. We set the per_gpu_train_batch_size to fit a single GPU with 11 GB GPU Memory, and you can increase the number.

Please refer the additional details of optional arguments in run_classification_consistency.py or run python run_classification_consistency.py -h.

RoBERTa (base) baseline model

python run_classification_consistency.py \
--model_type roberta \
--model_name_or_path roberta-base \
--task_name wiqa \
--do_train \
--data_dir PATH_TO_WIQA_DATA_DIR \
--max_seq_length 256 --per_gpu_eval_batch_size=8 \
--per_gpu_train_batch_size=8 \
--gradient_accumulation_steps=8 \
--learning_rate 2e-5 \
--weight_decay 0.01 \
--output_dir PATH_TO_WIQA_MODEL_OUTPUT_DIR \
--seed 789

RoBERTa + Data Augmentation model

python run_classification_consistency.py \
--model_type roberta \
--model_name_or_path roberta-base \
--task_name wiqa \
--do_train \
--data_dir PATH_TO_AUGMENTED_DATA \
--max_seq_length 256 --per_gpu_eval_batch_size=8 \
--per_gpu_train_batch_size=8 \
--gradient_accumulation_steps=8 \
--learning_rate 2e-5 \
--weight_decay 0.01 \
--output_dir PATH_TO_WIQA_MODEL_OUTPUT_DIR \
--seed 789

RoBERTa + Data Augmentation + Consistency model

To train consistency models, you first need to train a model without consistency loss (for the annealing discussed in Section 3 in our paper). You can either set the --lambda_a and --lambda_b to 0 for the first 3 epochs, or start from the checkpoint of the RoBERTa baseline trained for 3 epochs.

python run_classification_consistency.py \
--model_type roberta_cons \
--model_name_or_path PATH_TO_BASELINE_MODEL_CHECKPOINTS_DIR \
--task_name wiqa --do_train \
--data_dir PATH_TO_AUGMENTED_DATA \
--max_seq_length 256 --per_gpu_eval_batch_size=4 \
--output_dir PATH_TO_WIQA_MODEL_OUTPUT_DIR \
--per_gpu_train_batch_size=4 --gradient_accumulation_steps=16 \
--learning_rate 2e-5 --num_train_epochs 3 \
--weight_decay 0.01 \
--lambda_a 0.5 --lambda_b 0.1 \
--seed 789 \
--use_consistency

Note: We've noticed that models (BERT, RoBERTa) are sensitive to some hyperparameter on WIQA. We have conducted intensive hyperparameter search at the beginning of our project, and use the same hyperparameter performing best with our RoBERTa baseline model throughout our experiments. Several recent papers (e.g., Dodge et al. (2020), Bisk et al. (2020)) discuss those sensitivity. There might be performance variance with different number of the training batch size.

3. Evaluation

WIQA

Run the command below (same for the RoBERTa baseline, RoBERTa + DA andRoBERTa + Data Augmentation + Consistency model). To test the performance on eval data, you can simply replace the do_eval option with do_test.

python run_classification_consistency.py \
--model_type roberta \
--model_name_or_path PATH_TO_WIQA_MODEL_OUTPUT_DIR \
--task_name wiqa \
--do_eval \
--data_dir PATH_TO_WIQA_DATA_DIR \
--max_seq_length 256 --per_gpu_eval_batch_size=8 \
--weight_decay 0.01 \
--output_dir PATH_TO_WIQA_MODEL_OUTPUT_DIR \
--seed 789

4. Contact

Please contact Akari Asai (Twitter:@AkariAsai, Email: alari[at]cs.washington.edu) for questions and suggestions.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 55

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗