Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → hitvoice → Drqa

hitvoice / Drqa

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch squad

Projects that are alternatives of or similar to Drqa

X3daudio1 7 hrtf

HRTF for Arma 3, Skyrim, and other titles that use XAudio2 + X3DAudio

Stars: ✭ 192 (-49.21%)

Mutual labels: squad

extractive rc by runtime mt

Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"

Stars: ✭ 36 (-90.48%)

Mutual labels: squad

FastFusionNet

A PyTorch Implementation of FastFusionNet on SQuAD 1.1

Stars: ✭ 38 (-89.95%)

Mutual labels: squad

Dawn Bench Entries

DAWNBench: An End-to-End Deep Learning Benchmark and Competition

Stars: ✭ 254 (-32.8%)

Mutual labels: squad

Question-Answering-based-on-SQuAD

Question Answering System using BiDAF Model on SQuAD v2.0

Stars: ✭ 20 (-94.71%)

Mutual labels: squad

question-answering

No description or website provided.

Stars: ✭ 32 (-91.53%)

Mutual labels: squad

Albert Tf2.0

ALBERT model Pretraining and Fine Tuning using TF2.0

Stars: ✭ 180 (-52.38%)

Mutual labels: squad

Learning to retrieve reasoning paths

The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".

Stars: ✭ 318 (-15.87%)

Mutual labels: squad

Squad

Dockerfile for automated build of a Squad gameserver: https://hub.docker.com/r/cm2network/squad/

Stars: ✭ 21 (-94.44%)

Mutual labels: squad

Medi-CoQA

Conversational Question Answering on Clinical Text

Stars: ✭ 22 (-94.18%)

Mutual labels: squad

QANet-pytorch-

A Pytorch implementation of QANet

Stars: ✭ 91 (-75.93%)

Mutual labels: squad

Transformer-QG-on-SQuAD

Implement Question Generator with SOTA pre-trained Language Models (RoBERTa, BERT, GPT, BART, T5, etc.)

Stars: ✭ 28 (-92.59%)

Mutual labels: squad

PersianQA

Persian (Farsi) Question Answering Dataset (+ Models)

Stars: ✭ 114 (-69.84%)

Mutual labels: squad

R Net

R-net in PyTorch, with ELMo

Stars: ✭ 194 (-48.68%)

Mutual labels: squad

TensorFlow Models for the Stanford Question Answering Dataset

Stars: ✭ 72 (-80.95%)

Mutual labels: squad

R Net In Keras

Open R-NET implementation and detailed analysis: https://git.io/vd8dx

Stars: ✭ 181 (-52.12%)

Mutual labels: squad

MRC Competition Dureader

机器阅读理解冠军/亚军代码及中文预训练MRC模型

Stars: ✭ 552 (+46.03%)

Mutual labels: squad

R Net

A Tensorflow Implementation of R-net: Machine reading comprehension with self matching networks

Stars: ✭ 321 (-15.08%)

Mutual labels: squad

co-attention

Pytorch implementation of "Dynamic Coattention Networks For Question Answering"

Stars: ✭ 54 (-85.71%)

Mutual labels: squad

SQUAD2.Q-Augmented-Dataset

Augmented version of SQUAD 2.0 for Questions

Stars: ✭ 31 (-91.8%)

Mutual labels: squad

View All Similar Projects ➔

DrQA

A pytorch implementation of the ACL 2017 paper Reading Wikipedia to Answer Open-Domain Questions (DrQA).

Reading comprehension is a task to produce an answer when given a question and one or more pieces of evidence (usually natural language paragraphs). Compared to question answering over knowledge bases, reading comprehension models are more flexible and have revealed a great potential for zero-shot learning.

SQuAD is a reading comprehension benchmark where there's only a single piece of evidence and the answer is guaranteed to be a part of the evidence. Since the publication of SQuAD dataset, there has been fast progress in the research of reading comprehension and a bunch of great models have come out. DrQA is one that is conceptually simpler than most others but still yields strong performance even as a single model.

The motivation for this project is to offer a clean version of DrQA for the machine reading comprehension task, so one can quickly do some modifications and try out new ideas. Click here to see the comparison with what's described in the original paper and with two "official" projects ParlAI and DrQA.

Requirements

python >=3.5
pytorch 0.4. Historical versions:
- DrQA with pytorch 0.3
- DrQA with pytorch 0.2
numpy
msgpack
spacy 1.x

Quick Start

Setup

download the project via git clone https://github.com/hitvoice/DrQA.git; cd DrQA
make sure python 3, pip, wget and unzip are installed.
install pytorch matched with your OS, python and cuda versions.
install the remaining requirements via pip install -r requirements.txt
download the SQuAD datafile, GloVe word vectors and Spacy English language models using bash download.sh.

Train

# prepare the data
python prepro.py
# train for 40 epochs with batchsize 32
python train.py -e 40 -bs 32

Warning: Running prepro.py takes about 9G memory when using 8 threads. If there's not enough memory on your machine, try reducing the number of threads used by the script, for example, python prepro.py --threads 2

Predict

python interact.py

Example interactions:

Evidence: Super Bowl 50 was an American football game to determine the champion of the National Football League (NFL) for the 2015 season. The American Football Conference (AFC) champion Denver Broncos defeated the National Football Conference (NFC) champion Carolina Panthers 24-10 to earn their third Super Bowl title. The game was played on February 7, 2016, at Levi's Stadium in the San Francisco Bay Area at Santa Clara, California.
Question: What day was the game played on?
Answer: February 7, 2016
Time: 0.0245s

Evidence: Super Bowl 50 was an American football game to determine the champion of the National Football League (NFL) for the 2015 season. The American Football Conference (AFC) champion Denver Broncos defeated the National Football Conference (NFC) champion Carolina Panthers 24-10 to earn their third Super Bowl title. The game was played on February 7, 2016, at Levi's Stadium in the San Francisco Bay Area at Santa Clara, California.
Question: What is the AFC short for?
Answer: The American Football Conference
Time: 0.0214s

Evidence: Beanie style with simple design. So cool to wear and make you different. It wears as peak cap and a fashion cap. It is good to match your clothes well during party and holiday, also makes you charming and fashion, leisure and fashion in public and streets. It suits all adults, for men or women. Matches well with your winter outfits so you stay warm all winter long.
Question: Is it for women?
Answer: It suits all adults, for men or women
Time: 0.0238s

The last example is a randomly picked product description from Amazon (not in SQuAD).

Results

EM & F1

	EM	F1
in the original paper	69.5	78.8
in this project	69.64	78.76
offical(Spacy)	69.71	78.94
offical(CoreNLP)	69.76	79.09

Compared with the official implementation:

Detailed Comparisons

Compared to what's described in the original paper:

The grammatical features are generated by spaCy instead of Stanford CoreNLP. It's much faster and produces similar scores.

Compared to the code in facebookresearch/DrQA:

This project is much more light-weighted and focusing solely on training and evaluating on SQuAD dataset while lacking the document retriever, the interactive inference API, and some other features.
The implementation in facebookresearch/DrQA is able to train on multiple GPUs, while (currently and for simplicity) in this implementation we only support single-GPU training.

Compared to the code in facebookresearch/ParlAI:

The DrQA model is no longer wrapped in a chatbot framework, which makes the code more readable, easier to modify and is faster to train. The preprocessing for text corpus is performed only once, while in a dialog framework raw text is transmitted each time and preprocessing for the same text must be done again and again.
This is a full implementation of the original paper, while the model in ParlAI is a partial implementation, missing all grammatical features (lemma, POS tags and named entity tags).
Some minor bug fixes. Some of them have been merged into ParlAI.

About

Maintainer: Runqi Yang.

Credits: thank Jun Yang for code review and advice.

Most of the pytorch model code is borrowed from Facebook/ParlAI under a BSD-3 license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 378

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (8) 🔗