Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → DFKI-NLP → Tre

DFKI-NLP / Tre

Licence: mit

[AKBC 19] Improving Relation Extraction by Pre-trained Language Representations

Programming Languages

python

139335 projects - #7 most used programming language

Labels

machine-learning nlp transformer relation-extraction information-extraction

Projects that are alternatives of or similar to Tre

Distre

[ACL 19] Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction

Stars: ✭ 75 (-21.05%)

Mutual labels: relation-extraction, information-extraction, transformer

PSPE

Pretrained Span and span Pair Encoder, code for "Pre-training Entity Relation Encoder with Intra-span and Inter-spanInformation.", EMNLP2020. It is based on our NERE toolkit (https://github.com/Receiling/NERE).

Stars: ✭ 17 (-82.11%)

Mutual labels: information-extraction, relation-extraction

IE Paper Notes

Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Extraction (EE), Named Entity Disambiguation (NED).

Stars: ✭ 14 (-85.26%)

Mutual labels: information-extraction, relation-extraction

Tacred Relation

PyTorch implementation of the position-aware attention model for relation extraction

Stars: ✭ 271 (+185.26%)

Mutual labels: relation-extraction, information-extraction

CogIE

CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021

Stars: ✭ 47 (-50.53%)

Mutual labels: information-extraction, relation-extraction

InformationExtractionSystem

Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.

Stars: ✭ 27 (-71.58%)

Mutual labels: information-extraction, relation-extraction

Multiple Relations Extraction Only Look Once

Multiple-Relations-Extraction-Only-Look-Once. Just look at the sentence once and extract the multiple pairs of entities and their corresponding relations. 端到端联合多关系抽取模型，可用于 http://lic2019.ccf.org.cn/kg 信息抽取。

Stars: ✭ 269 (+183.16%)

Mutual labels: relation-extraction, information-extraction

Open Ie Papers

Open Information Extraction (OpenIE) and Open Relation Extraction (ORE) papers and data.

Stars: ✭ 150 (+57.89%)

Mutual labels: relation-extraction, information-extraction

Gcn Over Pruned Trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

Stars: ✭ 312 (+228.42%)

Mutual labels: relation-extraction, information-extraction

Aggcn

Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)

Stars: ✭ 318 (+234.74%)

Mutual labels: relation-extraction, information-extraction

Casrel

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction. Accepted by ACL 2020.

Stars: ✭ 329 (+246.32%)

Mutual labels: relation-extraction, information-extraction

DocuNet

Code and dataset for the IJCAI 2021 paper "Document-level Relation Extraction as Semantic Segmentation".

Stars: ✭ 84 (-11.58%)

Mutual labels: information-extraction, relation-extraction

ReQuest

Indirect Supervision for Relation Extraction Using Question-Answer Pairs (WSDM'18)

Stars: ✭ 26 (-72.63%)

Mutual labels: information-extraction, relation-extraction

Relation-Extraction-Transformer

NLP: Relation extraction with position-aware self-attention transformer

Stars: ✭ 63 (-33.68%)

Mutual labels: transformer, relation-extraction

lima

The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.

Stars: ✭ 75 (-21.05%)

Mutual labels: information-extraction, relation-extraction

knowledge-graph-nlp-in-action

从模型训练到部署，实战知识图谱(Knowledge Graph)&自然语言处理(NLP)。涉及 Tensorflow, Bert+Bi-LSTM+CRF,Neo4j等涵盖 Named Entity Recognition,Text Classify,Information Extraction,Relation Extraction 等任务。

Stars: ✭ 58 (-38.95%)

Mutual labels: information-extraction, relation-extraction

Pytorch multi head selection re

BERT + reproduce "Joint entity recognition and relation extraction as a multi-head selection problem" for Chinese and English IE

Stars: ✭ 105 (+10.53%)

Mutual labels: relation-extraction, information-extraction

Information Extraction Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

Stars: ✭ 1,888 (+1887.37%)

Mutual labels: relation-extraction, information-extraction

Oie Resources

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

Stars: ✭ 283 (+197.89%)

Mutual labels: relation-extraction, information-extraction

Open Entity Relation Extraction

Knowledge triples extraction and knowledge base construction based on dependency syntax for open domain text.

Stars: ✭ 350 (+268.42%)

Mutual labels: relation-extraction, information-extraction

View All Similar Projects ➔

Improving Relation Extraction by Pre-trained Language Representations

This repository contains the code of our paper:
Improving Relation Extraction by Pre-trained Language Representations.
Christoph Alt*, Marc Hübner*, Leonhard Hennig

We fine-tune the pre-trained OpenAI GPT [1] to the task of relation extraction and show that it achieves state-of-the-art results on SemEval 2010 Task 8 and TACRED relation extraction datasets.

Our code depends on huggingface's PyTorch reimplementation of the OpenAI GPT [2] - so thanks to them.

Installation

First, clone the repository to your machine and install the requirements with the following command:

pip install -r requirements.txt

We also need the weights of the pre-trained Transformer, which can be downloaded with the following command:

./download-model.sh

The english spacy model is required for sentence segmentation:

python -m spacy download en

Prepare the data

We evaluate our model on SemEval 2010 Task 8 and TACRED, which is available through LDC.

Our model expects the input dataset to be in JSONL format. To convert a dataset run the following command:

python dataset_converter.py <DATASET DIR> <CONVERTED DATASET DIR> --dataset=<DATASET NAME>

Training

E.g. for training on the TACRED dataset, run the following command:

CUDA_VISIBLE_DEVICES=0 python relation_extraction.py train \
  --write-model True \
  --masking-mode grammar_and_ner \
  --batch-size 8 \
  --max-epochs 3 \
  --lm-coef 0.5 \
  --learning-rate 5.25e-5 \
  --learning-rate-warmup 0.002 \
  --clf-pdrop 0.1 \
  --attn-pdrop 0.1 \
  --word-pdrop 0.0 \
  --dataset tacred \
  --data-dir <CONVERTED DATASET DIR> \
  --seed=0 \
  --log-dir ./logs/

Evaluation

CUDA_VISIBLE_DEVICES=0 python relation_extraction.py evaluate \
  --dataset tacred \
  --masking_mode grammar_and_ner \
  --test_file ./data/tacred/test.jsonl \
  --save_dir ./logs/ \
  --model_file <MODEL FILE (e.g. model_epoch...)> \
  --batch_size 8 \
  --log_dir ./logs/

Trained Models

The models we trained on SemEval and TACRED to produce our paper results can be found here:

Dataset	Masking Mode	P	R	F1	Download
TACRED	grammar_and_ner	70.0	65.0	67.4	Link
SemEval	None	87.6	86.8	87.1	Link

Download and extract model files

First, download the archive corresponding to the model you want to evaluate (links in the table above).

wget --content-disposition <DOWNLOAD URL>

Extract the model archive containing model.pt, text_encoder.pkl, and label_encoder.pkl.

tar -xvzf <MODEL ARCHIVE>

Run evaluation

dataset: dataset to evaluate, can be one of "semeval" or "tacred".
test-file: path to the JSONL test file used during evaluation
log-dir: directory to store the evaluation results and predictions
save-dir: directory containing the downloaded model files (model.pt, text_encoder.pkl, and label_encoder.pkl)
masking-mode: masking mode to use during evaluation, can be one of "None", "grammar_and_ner", "grammar", "ner" or "unk" (caution: must match the mode for training)

For example, to evaluate the TACRED model with "grammar_and_ner" masking, run the following command:

CUDA_VISIBLE_DEVICES=0 python relation_extraction.py evaluate \
      --dataset tacred \
      --test-file ./<CONVERTED DATASET DIR>/test.jsonl \
      --log-dir <RESULTS DIR> \
      --save-dir <MODEL DIR> \
      --masking_mode grammar_and_ner

Citations

If you use our code in your research or find our repository useful, please consider citing our work.

@InProceedings{alt_improving_2019,
  author = {Alt, Christoph and H\"{u}bner, Marc and Hennig, Leonhard},
  title = {Improving Relation Extraction by Pre-trained Language Representations},
  booktitle = {Proceedings of AKBC 2019},
  year = {2019},
  url = {https://openreview.net/forum?id=BJgrxbqp67},
}

License

lm-transformer-re is released under the MIT license. See LICENSE for additional details.

References

Improving language understanding by generative pre-training. Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever.
PyTorch implementation of OpenAI's Finetuned Transformer Language Model

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 95

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (3) 🔗