Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → beader → Ruijin_round2

beader / Ruijin_round2

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

Labels

jupyter-notebook nlp relation-extraction

Projects that are alternatives of or similar to Ruijin round2

Pytorch graph Rel

A PyTorch implementation of GraphRel

Stars: ✭ 204 (+28.3%)

Mutual labels: jupyter-notebook, relation-extraction

Relation Classification Using Bidirectional Lstm Tree

TensorFlow Implementation of the paper "End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures" and "Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Paths" for classifying relations

Stars: ✭ 167 (+5.03%)

Mutual labels: jupyter-notebook, relation-extraction

基于深度学习的开源中文关系抽取框架

Stars: ✭ 525 (+230.19%)

Mutual labels: jupyter-notebook, relation-extraction

Convolutional Neural Network for Multi-label Multi-instance Relation Extraction in Tensorflow

Stars: ✭ 190 (+19.5%)

Mutual labels: jupyter-notebook, relation-extraction

论文实现(ACL2019)：《Matching the Blanks: Distributional Similarity for Relation Learning》

Stars: ✭ 146 (-8.18%)

Mutual labels: jupyter-notebook, relation-extraction

Patch Based Texture Synthesis

Based on "Image Quilting for Texture Synthesis and Transfer" and "Real-Time Texture Synthesis by Patch-Based Sampling" papers

Stars: ✭ 159 (+0%)

Mutual labels: jupyter-notebook

A sequence of Jupyter notebooks featuring the "12 Steps to Navier-Stokes" http://lorenabarba.com/

Stars: ✭ 2,180 (+1271.07%)

Mutual labels: jupyter-notebook

Gpt2 Bert Reddit Bot

a bot that generates realistic replies using a combination of pretrained GPT-2 and BERT models

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Deploy and scale serverless machine learning app - in 4 steps.

Stars: ✭ 157 (-1.26%)

Mutual labels: jupyter-notebook

资产配置方案

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Lecture Python.notebooks

Notebooks for https://python.quantecon.org

Stars: ✭ 159 (+0%)

Mutual labels: jupyter-notebook

House Price Prediction

Predicting house prices using Linear Regression and GBR

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

HandySpark - bringing pandas-like capabilities to Spark dataframes

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

HDDM is a python module that implements Hierarchical Bayesian parameter estimation of Drift Diffusion Models (via PyMC).

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Kaggle Environments

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Creates a learning-curve plot for Jupyter/Colab notebooks that is updated in real-time.

Stars: ✭ 159 (+0%)

Mutual labels: jupyter-notebook

Covid19 mobility

COVID-19 Mobility Data Aggregator. Scraper of Google, Apple, Waze and TomTom COVID-19 Mobility Reports🚶🚘🚉

Stars: ✭ 156 (-1.89%)

Mutual labels: jupyter-notebook

⭐️ PyTorch implement of Deeply Supervised Salient Object Detection with Short Connection

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Train transformer language models with reinforcement learning.

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

Tensorflow Dataset Tutorial

Notebook for my medium article about how to use Dataset API in TensorFlow

Stars: ✭ 158 (-0.63%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

瑞金医院MMC人工智能辅助构建知识图谱大赛复赛

⚠️ 由于可能存在的版权问题，请自行联系大赛主办方索要数据，在 Issues 中索要数据的请求将不再回复，谢谢!

背景

复赛题目是在 Named Entity 给定的基础上，做 Relation 抽取。

初赛代码见 beader/ruijin_round1

实体关系类别名称:

From Entity Type	To Entity Type	Relation Type
检查方法	疾病	Test_Disease
临床表现	疾病	Symptom_Disease
非药治疗	疾病	Treatment_Disease
药品名称	疾病	Drug_Disease
部位	疾病	Anatomy_Disease
用药频率	药品名称	Frequency_Drug
持续时间	药品名称	Duration_Drug
用药剂量	药品名称	Amount_Drug
用药方法	药品名称	Method_Drug
不良反应	药品名称	SideEff-Drug

数据样例

0.txt

中国成人2型糖尿病HBA1C  c控制目标的专家共识
目前,2型糖尿病及其并发症已经成为危害公众
健康的主要疾病之一,控制血糖是延缓糖尿病进展及
其并发症发生的重要措施之一。虽然HBA1C  。是评价血
糖控制水平的公认指标,但应该控制的理想水平即目
标值究竟是多少还存在争议。糖尿病控制与并发症试
验(DCCT,1993)、熊本(Kumamoto,1995)、英国前瞻性
糖尿病研究(UKPDS,1998)等高质量临床研究已经证
实,对新诊断的糖尿病患者或病情较轻的患者进行严
格的血糖控制会延缓糖尿病微血管病变的发生、发展,

0.ann

T1	Disease 1845 1850	1型糖尿病
T2	Disease 1983 1988	1型糖尿病
T4	Disease 30 35	2型糖尿病
T5	Disease 1822 1827	2型糖尿病
...
R206	Symptom_Disease Arg1:T329 Arg2:T325
R207	Symptom_Disease Arg1:T331 Arg2:T325
R208	Test_Disease Arg1:T337 Arg2:T338
R209	Treatment_Disease Arg1:T343 Arg2:T345
R210	Treatment_Disease Arg1:T344 Arg2:T345

数据使用 brat 进行标注，每个 .txt 文件对应一个 .ann 标注文件。

模型

构建训练样本

之前没有做 Relation Extraction 的经验，最直觉的想法是当成一个二分类问题来做。先生成 Candidate Entity Pairs，做一些简单的过滤，然后利用训练集中的 Relation 数据给 Candidate Entity Pairs 打 0 或者 1 的标签。

比赛中，用中文句号 (。) 做句子切分，选取 size=2, step=1 的滑动窗口来生成句子。即每个句子包含原始文章中的2句话。接着把每个句子中出现的 entities 做个排列组合，把不存在于比赛要求的 10 个 relation type 中的组合过滤掉，作为 candidate entity pairs。

向量化

对每个样本进行向量化，提取 5 个向量作为模型的输入。

char id sequence 为转化为字符id后的句子文本序列
entity labels vector 为代表 entity 类别的向量
from entity mask 用 [1] 标记出 from_entity 的位置，剩余位置补 [0]
to entity mask 用 [1] 标记出 to_entity 的位置，剩余位置补 [0]
entity distance 为一个带符号的实数，用来表示两个 entity 的距离

神经网络结构

效果评估

复赛采用 F1-Score 来衡量模型效果。最终这个 baseline model 线上的成绩为 0.733

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 159

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗