Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → mcQA-suite → mcQA

mcQA-suite / mcQA

Licence: Apache-2.0 License

🔮 Answering multiple choice questions with Language Models.

Programming Languages

139335 projects - #7 most used programming language

Labels

nlp machine-learning natural-language-processing deep-learning pytorch artificial-intelligence question-answering bert

Projects that are alternatives of or similar to mcQA

🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.

Stars: ✭ 3,409 (+14721.74%)

Mutual labels: question-answering, bert

吹逼我们是认真的

Stars: ✭ 45 (+95.65%)

Mutual labels: question-answering, bert

DrFAQ is a plug-and-play question answering NLP chatbot that can be generally applied to any organisation's text corpora.

Stars: ✭ 29 (+26.09%)

Mutual labels: question-answering, bert

Financial Domain Question Answering with pre-trained BERT Language Model

Stars: ✭ 70 (+204.35%)

Mutual labels: question-answering, bert

BERT-for-Chinese-Question-Answering

No description or website provided.

Stars: ✭ 75 (+226.09%)

Mutual labels: question-answering, bert

Nlp chinese corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

Stars: ✭ 6,656 (+28839.13%)

Mutual labels: question-answering, bert

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Stars: ✭ 118 (+413.04%)

Mutual labels: question-answering, bert

中文wiki百科QA阅读理解问答系统，使用了CCKS2016数据的NER模型和CMRC2018的阅读理解模型，还有W2V词向量搜索,使用torchserve部署

Stars: ✭ 46 (+100%)

Mutual labels: question-answering, bert

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (+717.39%)

Mutual labels: question-answering, bert

KitanaQA: Adversarial training and data augmentation for neural question-answering models

Stars: ✭ 58 (+152.17%)

Mutual labels: question-answering, bert

Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.

Stars: ✭ 229 (+895.65%)

Mutual labels: question-answering, bert

Conversational Question Answering on Clinical Text

Stars: ✭ 22 (-4.35%)

Mutual labels: question-answering, bert

SQUAD2.Q-Augmented-Dataset

Augmented version of SQUAD 2.0 for Questions

Stars: ✭ 31 (+34.78%)

Mutual labels: question-answering, bert

⛔ [NOT MAINTAINED] A web interface for cdQA and other question answering systems.

Stars: ✭ 19 (-17.39%)

Mutual labels: question-answering, bert

query-focused-sum

Official code repository for "Exploring Neural Models for Query-Focused Summarization".

Stars: ✭ 17 (-26.09%)

Mutual labels: question-answering

Very Simple Question Answer System using Chinese Wikipedia Data

Stars: ✭ 24 (+4.35%)

Mutual labels: question-answering

Quality-Estimation2

机器翻译子任务-翻译质量评价-在BERT模型后面加上Bi-LSTM进行fine-tuning

Stars: ✭ 31 (+34.78%)

Mutual labels: bert

Extraction of the five journalistic W-questions (5W) from news articles

Stars: ✭ 16 (-30.43%)

Mutual labels: question-answering

Chinese-Psychological-QA-DataSet

中文心理问答数据集

Stars: ✭ 23 (+0%)

Mutual labels: question-answering

An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统，一键部署微信闲聊机器人)

Stars: ✭ 94 (+308.7%)

Mutual labels: bert

View All Similar Projects ➔

mcQA : Multiple Choice Questions Answering

Answering multiple choice questions with Language Models.

News 📢

🚧 This project is currently under development. Stay tuned ! 🤩

Jun 6th, 2020

Refactored data subpackage, the library now supports RACE, Synonym, Swag and ARC data sets.
Upgrade to transformers==2.10.0.

Installation

With pip

pip install mcqa

From source

git clone https://github.com/mcqa-suite/mcqa.git
cd mcQA
pip install -e .

Getting started

Data preparation

To train a mcQA model, you need to create a csv file with n+2 columns, n being the number of choices for each question. The first column should be the context sentence, the n following columns should be the choices for that question and the last column is the selected answer.

Below is an example of a 3 choice question (taken from the CoS-E dataset) :

Context sentence	Choice 1	Choice 2	Choice 3	Label
People do what during their time off from work?	take trips	brow shorter	become hysterical	take trips

If you have a trained mcQA model and want to infer on a dataset, it should have the same format as the train data, but the label column.

See example data preparation below:

from mcqa.data import MCQAData

mcqa_data = MCQAData(bert_model="bert-base-uncased", lower_case=True, max_seq_length=256) 
                     
train_dataset = mcqa_data.read(data_file='swagaf/data/train.csv', is_training=True)
test_dataset = mcqa_data.read(data_file='swagaf/data/test.csv', is_training=False)

Model training

from mcqa.models import Model

mdl = Model(bert_model="bert-base-uncased", device="cuda") 
            
mdl.fit(train_dataset, train_batch_size=32, num_train_epochs=20)

Prediction

preds = mdl.predict(test_dataset, eval_batch_size=32)

Evaluation

from sklearn.metrics import accuracy_score
from mcqa.data import get_labels

print(accuracy_score(preds, get_labels(train_dataset)))

References

Type	Title	Author	Year
📰 Paper	Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets	Mor Geva, Yoav Goldberg, Jonathan Berant	2019
📰 Paper	Explain Yourself! Leveraging Language Models for Commonsense Reasoning	Nazneen Fatema Rajani, Bryan McCann, Caiming Xiong and Richard Socher	2019
📰 Paper	SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference	Rowan Zellers, Yonatan Bisk, Roy Schwartz and Yejin Choi	2018
📰 Paper	Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering	Todor Mihaylov, Peter Clark, Tushar Khot, Ashish Sabharwal	2018
📰 Paper	CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge	Alon Talmor, Jonathan Herzig, Nicholas Lourie, Jonathan Berant	2018
📰 Paper	RACE: Large-scale ReAding Comprehension Dataset From Examinations	Guokun Lai, Qizhe Xie, Hanxiao Liu, Yiming Yang and Eduard Hovy	2017
💻 Framework	Scikit-learn: Machine Learning in Python	Pedregosa et al.	2011
💻 Framework	PyTorch	Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan	2016
💻 Framework	Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.	Hugging Face	2018
📹 Video	Stanford CS224N: NLP with Deep Learning Lecture 10 – Question Answering	Christopher Manning	2019

LICENSE

Contributing

Read our Contributing Guidelines.

Citation

@misc{Taycir2019,
  author = {mcQA-suite},
  title = {mcQA},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/mcQA-suite/mcQA/}}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 23

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (16) 🔗