Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → thunlp → Fewrel

thunlp / Fewrel

Licence: mit

A Large-Scale Few-Shot Relation Extraction Dataset

Programming Languages

139335 projects - #7 most used programming language

Labels

natural-language-processing relation-extraction

Projects that are alternatives of or similar to Fewrel

REx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, and Probabalistic First-Order Logic"

Stars: ✭ 21 (-96.01%)

Mutual labels: natural-language-processing, relation-extraction

Deeplearning nlp

基于深度学习的自然语言处理库

Stars: ✭ 154 (-70.72%)

Mutual labels: natural-language-processing, relation-extraction

Knowledge Graphs

A collection of research on knowledge graphs

Stars: ✭ 845 (+60.65%)

Mutual labels: natural-language-processing, relation-extraction

Awesome Relation Extraction

📖 A curated list of awesome resources dedicated to Relation Extraction, one of the most important tasks in Natural Language Processing (NLP).

Stars: ✭ 656 (+24.71%)

Mutual labels: natural-language-processing, relation-extraction

LanguageCrunch NLP server docker image

Stars: ✭ 281 (-46.58%)

Mutual labels: natural-language-processing, relation-extraction

Pytorch graph Rel

A PyTorch implementation of GraphRel

Stars: ✭ 204 (-61.22%)

Mutual labels: natural-language-processing, relation-extraction

An open relation extraction system

Stars: ✭ 46 (-91.25%)

Mutual labels: natural-language-processing, relation-extraction

Gcn Over Pruned Trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

Stars: ✭ 312 (-40.68%)

Mutual labels: natural-language-processing, relation-extraction

Tacred Relation

PyTorch implementation of the position-aware attention model for relation extraction

Stars: ✭ 271 (-48.48%)

Mutual labels: natural-language-processing, relation-extraction

EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information

Stars: ✭ 222 (-57.79%)

Mutual labels: natural-language-processing, relation-extraction

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

Stars: ✭ 283 (-46.2%)

Mutual labels: natural-language-processing, relation-extraction

Usc Ds Relationextraction

Distantly Supervised Relation Extraction

Stars: ✭ 378 (-28.14%)

Mutual labels: natural-language-processing, relation-extraction

RNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.

Stars: ✭ 487 (-7.41%)

Mutual labels: natural-language-processing

💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy

Stars: ✭ 508 (-3.42%)

Mutual labels: natural-language-processing

Open Machine Learning course at MIPT

Stars: ✭ 480 (-8.75%)

Mutual labels: natural-language-processing

Learn Data Science For Free

This repositary is a combination of different resources lying scattered all over the internet. The reason for making such an repositary is to combine all the valuable resources in a sequential manner, so that it helps every beginners who are in a search of free and structured learning resource for Data Science. For Constant Updates Follow me in …

Stars: ✭ 4,757 (+804.37%)

Mutual labels: natural-language-processing

A collection of notebooks for Natural Language Processing from NLP Town

Stars: ✭ 513 (-2.47%)

Mutual labels: natural-language-processing

A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)

Stars: ✭ 502 (-4.56%)

Mutual labels: natural-language-processing

An open source Ruby framework for text and voice chatbots. 🤖

Stars: ✭ 481 (-8.56%)

Mutual labels: natural-language-processing

Textgan Pytorch

TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.

Stars: ✭ 479 (-8.94%)

Mutual labels: natural-language-processing

View All Similar Projects ➔

FewRel Dataset, Toolkits and Baseline Models

Our benchmark website: https://thunlp.github.io/fewrel.html

FewRel is a large-scale few-shot relation extraction dataset, which contains more than one hundred relations and tens of thousands of annotated instances cross different domains. Our dataset is presented in our EMNLP 2018 paper FewRel: A Large-Scale Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation and a following-up version is presented in our EMNLP 2019 paper FewRel 2.0: Towards More Challenging Few-Shot Relation Classification.

Based on our dataset and designed few-shot settings, we have two different benchmarks:

FewRel 1.0: This is the first one to incorporate few-shot learning with relation extraction, where your model need to handle both the few-shot challenge and extracting entity relations from plain text. This benchmark provides a training dataset with 64 relations and a validation set with 16 relations. Once you submit your code to our benchmark website, it will be evaluated on a hidden test set with 20 relations. Each relation has 100 human-annotated instances.
FewRel 2.0: We found out that there are two long-neglected aspects in previous few-shot research: (1) How well models can transfer across different domains. (2) Can few-shot models detect instances belonging to none of the given few-shot classes. To dig deeper in these two aspects, we propose the 2.0 version of our dataset, with newly-added domain adaptation (DA) and none-of-the-above (NOTA) detection challenges. Find our more in our paper and evaluation websites FewRel 2.0 domain adaptation / FewRel 2.0 none-of-the-above detection

Citing

If you used our data, toolkits or baseline models, please kindly cite our paper:

@inproceedings{han-etal-2018-fewrel,
    title = "{F}ew{R}el: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation",
    author = "Han, Xu and Zhu, Hao and Yu, Pengfei and Wang, Ziyun and Yao, Yuan and Liu, Zhiyuan and Sun, Maosong",
    booktitle = "Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing",
    month = oct # "-" # nov,
    year = "2018",
    address = "Brussels, Belgium",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D18-1514",
    doi = "10.18653/v1/D18-1514",
    pages = "4803--4809"
}

@inproceedings{gao-etal-2019-fewrel,
    title = "{F}ew{R}el 2.0: Towards More Challenging Few-Shot Relation Classification",
    author = "Gao, Tianyu and Han, Xu and Zhu, Hao and Liu, Zhiyuan and Li, Peng and Sun, Maosong and Zhou, Jie",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-1649",
    doi = "10.18653/v1/D19-1649",
    pages = "6251--6256"
}

If you have questions about any part of the paper, submission, leaderboard, codes, data, please e-mail [email protected].

Contributions

For FewRel 1.0, Hao Zhu first proposed this problem and proposed the way to build the dataset and the baseline system; Ziyuan Wang built and maintained the crowdsourcing website; Yuan Yao helped download the original data and conducted preprocess; Xu Han, Hao Zhu, Pengfei Yu and Ziyun Wang implemented baselines and wrote the paper together; Zhiyuan Liu provided thoughtful advice and funds through the whole project. The order of the first four authors are determined by dice rolling.

Dataset and Pretrain files

The dataset has already be contained in the github repo. However, due to the large size, glove files (pre-trained word embeddings) and BERT pretrain checkpoint are not included. Please download pretrain.tar from Tsinghua Cloud and put it under the root. Then run tar xvf pretrain.tar to decompress it.

We also provide pid2name.json to show the Wikidata PID, name and description for each relation.

Note: We did not release the test dataset for both FewRel 1.0 and 2.0 for fair comparison. We recommend you to evaluate your models on the validation set first, and then submit it to our evaluation websites (which you can find above).

Training a Model

To run our baseline models, use command

python train_demo.py

This will start the training and evaluating process of Prototypical Networks in a 5-way 5-shot setting. You can also use different args to start different process. Some of them are here:

train / val / test: Specify the training / validation / test set. For example, if you use train_wiki for train, the program will load data/train_wiki.json for training. You should always use train_wiki for training and val_wiki (FewRel 1.0 and FewRel 2.0 NOTA challenge) or val_pubmed (FewRel 2.0 DA challenge) for validation.
trainN: N in N-way K-shot. trainN is the specific N in training process.
N: N in N-way K-shot.
K: K in N-way K-shot.
Q: Sample Q query instances for each relation.
model: Which model to use. The default one is proto, standing for Prototypical Networks. Note that if you use the PAIR model from our paper FewRel 2.0, you should also use --encoder bert --pair.
encoder: Which encoder to use. You can choose cnn or bert.
na_rate: NA rate for FewRel 2.0 none-of-the-above (NOTA) detection. Note that here na_rate specifies the rate between Q for NOTA and Q for positive. For example, na_rate=0 means the normal setting, na_rate=1,2,5 corresponds to NA rate = 15%, 30% and 50% in 5-way settings.

There are also many args for training (like batch_size and lr) and you can find more details in our codes.

Inference

You can evaluate an existing checkpoint by

python train_demo.py --only_test --load_ckpt {CHECKPOINT_PATH} {OTHER_ARGS}

Here we provide a BERT-PAIR checkpoint (trained on FewRel 1.0 dataset, 5 way 1 shot).

Reproduction

BERT-PAIR for FewRel 1.0

python train_demo.py \
    --trainN 5 --N 5 --K 1 --Q 1 \
    --model pair --encoder bert --pair --hidden_size 768 --val_step 1000 \
    --batch_size 4  --fp16 \

Note that --fp16 requires Nvidia's apex.

	5 way 1 shot	5 way 5 shot	10 way 1 shot	10 way 5 shot
Val	85.66	89.48	76.84	81.76
Test	88.32	93.22	80.63	87.02

BERT-PAIR for Domain Adaptation (FewRel 2.0)

python train_demo.py \
    --trainN 5 --N 5 --K 1 --Q 1 \
    --model pair --encoder bert --pair --hidden_size 768 --val_step 1000 \
    --batch_size 4  --fp16 --val val_pubmed --test val_pubmed \

	5 way 1 shot	5 way 5 shot	10 way 1 shot	10 way 5 shot
Val	70.70	80.59	59.52	70.30
Test	67.41	78.57	54.89	66.85

BERT-PAIR for None-of-the-Above (FewRel 2.0)

python train_demo.py \
    --trainN 5 --N 5 --K 1 --Q 1 \
    --model pair --encoder bert --pair --hidden_size 768 --val_step 1000 \
    --batch_size 4  --fp16 --na_rate 5 \

	5 way 1 shot (0% NOTA)	5 way 1 shot (50% NOTA)	5 way 5 shot (0% NOTA)	5 way 5 shot (50% NOTA)
Val	74.56	73.09	75.01	75.38
Test	76.73	80.31	83.32	84.64

Proto-CNN + Adversarial Training for Domain Adaptation (FewRel 2.0)

python train_demo.py \
    --val val_pubmed --adv pubmed_unsupervised --trainN 10 --N {} --K {} \ 
    --model proto --encoder cnn --val_step 1000 \

	5 way 1 shot	5 way 5 shot	10 way 1 shot	10 way 5 shot
Val	48.73	64.38	34.82	50.39
Test	42.21	58.71	28.91	44.35

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 526

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗