Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → ZihengZZH → industry-eval-EA

ZihengZZH / industry-eval-EA

Licence: MIT license

An Industry Evaluation of Embedding-based Entity Alignment @ COLING'20

Programming Languages

139335 projects - #7 most used programming language

Labels

knowledge-graph entity-alignment knowledge-graph-alignment biased-seed-mappings industry-benchmark

Projects that are alternatives of or similar to industry-eval-EA

Source code and datasets for ACL 2020 paper: Neighborhood Matching Network for Entity Alignment.

Stars: ✭ 55 (+189.47%)

Mutual labels: knowledge-graph, entity-alignment

Knowledge Association with Hyperbolic Knowledge Graph Embeddings, EMNLP 2020

Stars: ✭ 27 (+42.11%)

Mutual labels: knowledge-graph, entity-alignment

TypeDB-ML is the Machine Learning integrations library for TypeDB

Stars: ✭ 523 (+2652.63%)

Mutual labels: knowledge-graph

A project that uses Binary Ninja and GRAKN.AI to perform static analysis on binary files with the goal of identifying bugs in software.

Stars: ✭ 49 (+157.89%)

Mutual labels: knowledge-graph

RO is an ontology of relations for use with biological ontologies

Stars: ✭ 63 (+231.58%)

Mutual labels: knowledge-graph

使用知识图谱，自然语言处理，卷积神经网络等技术，基于python语言，设计了一个数控领域故障诊断专家系统

Stars: ✭ 109 (+473.68%)

Mutual labels: knowledge-graph

Knowledge-aware recommendation papers.

Stars: ✭ 76 (+300%)

Mutual labels: knowledge-graph

semantic-python-overview

(subjective) overview of projects which are related both to python and semantic technologies (RDF, OWL, Reasoning, ...)

Stars: ✭ 406 (+2036.84%)

Mutual labels: knowledge-graph

An Efficient RML-Compliant Engine for Knowledge Graph Construction

Stars: ✭ 68 (+257.89%)

Mutual labels: knowledge-graph

In-memory Graph Database and Knowledge Graph with Natural Language Interface, compatible with Pandas

Stars: ✭ 31 (+63.16%)

Mutual labels: knowledge-graph

[ACL 2021] KGPool: Dynamic Knowledge Graph Context Selection for Relation Extraction

Stars: ✭ 33 (+73.68%)

Mutual labels: knowledge-graph

Official implementation of Neural Bellman-Ford Networks (NeurIPS 2021)

Stars: ✭ 106 (+457.89%)

Mutual labels: knowledge-graph

PyTorch implementation for Graph Gated Neural Network (for Knowledge Graphs)

Stars: ✭ 34 (+78.95%)

Mutual labels: knowledge-graph

Improving Multi-hop Knowledge Base Question Answering by Learning Intermediate Supervision Signals. WSDM 2021.

Stars: ✭ 84 (+342.11%)

Mutual labels: knowledge-graph

Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr

Stars: ✭ 18 (-5.26%)

Mutual labels: knowledge-graph

ChineseStarsRelationship

中国明星数据爬取。你甚至可以拿到互联网上所有的人之间的关系，接下来你可以自己发挥！基于这些数据，你可以完成更多有趣的事情。比如说社交网络分析，关系网络可视化，算法研究，和其他有意思的事情。Chinese star data crawling. You can even get all the people on the internet! Based on these data, you can do more interesting things. For example, social network analysis, relational network visualization, algorithm research, and other interesting things.

Stars: ✭ 26 (+36.84%)

Mutual labels: knowledge-graph

ChineseTextAnalysisResouce

中文文本分析相关资源汇总

Stars: ✭ 71 (+273.68%)

Mutual labels: knowledge-graph

Social-Knowledge-Graph-Papers

A paper list of research about social knowledge graph

Stars: ✭ 27 (+42.11%)

Mutual labels: knowledge-graph

TypeDB: a strongly-typed database

Stars: ✭ 3,152 (+16489.47%)

Mutual labels: knowledge-graph

Code and data for our paper "Iterative Entity Alignment via Joint Knowledge Embeddings"

Stars: ✭ 43 (+126.32%)

Mutual labels: knowledge-graph

View All Similar Projects ➔

industry-eval-EA

The code and benchmark of paper An Industry Evaluation of Embedding-based Entity Alignment [arxiv] [coling] in Proceedings of COLING 2020.

Code

We present the source code to generate biased seed mappings for EA.

code
|__ check_benchmark.py
|__ sample_benchmark.py
|__ config.json

Specifically, we present a total of four settings in extracting biased seed mappings:

baseline [without any bias]
- "Ideal" in 4.2 of our paper
- "With No Bias" in 4.3 of our paper
name-biased [same name]
- "With Name Bias" in 4.3 of our paper
attr-biased [more attributes]
- "With Attribute Bias" in 4.3 of our paper
industry [same name & more attributes]
- "Industrial" in 4.2 of our paper

all of which follow the algorithm introduced in 3.2 of our paper.

To check the validity of any to-be-used benchmark, please run check_benchmark.py to verify the benchmark format.

To generate/sample biased seed mappings from to-be-used benchmark, please run sample_benchmark.py and later check train/val/test splits in the target_root_dir defined in config.json.

To reproduce the experimental results in our paper, please refer to OpenEA to run the experiments based on the biased seed mappings (mentioned above).

examples

We list some statistics of sampled biased seed mappings as follows.

<D_W_15K_V2> train_ratio: 0.02 val_ratio: 0.01
train-name-bias-stats:   same1.00  close0.00  diff0.00
train-attr-bias-stats:   large1.00  mid0.00  small0.00
val-name-bias-stats:   same1.00  close0.00  diff0.00
val-attr-bias-stats:   large1.00  mid0.00  small0.00
test-name-bias-stats:   same0.32  close0.21  diff0.47
test-attr-bias-stats:   large0.57  mid0.32  small0.10

When sampling biased seed mappings from the public benchmark D_W_15K_V2 under the industry setting (both name-biased and attribute-biased), it can be seen that train/val splits only contains biased seed mappings, which have the same name and large number of attributes.

Benchmark

We extracted the industry benchmark, named MED-BBK-9K, from two real-world medical KGs for alignment, which can be found here.

industry.zip
|__ attr_triples_1          # attribute triples of KG1
|__ attr_triples_2          # attribute triples of KG2
|__ ent_links               # entity links between KGs (ground-truth)
|__ rel_triples_1           # relation triples of KG1
|__ rel_triples_2           # relation triples of KG2

The statistics of the industry benchmark is listed as follows. D_W_15K_V2 is also recorded for the purpose of comparison.

Benchmark	KGs	#Ents	#Rels	#Rel triples	Rel degree	#Attrs	#Attr triples	Attr degree
MED-BBK-9K	MED	9,162	32	158,357	34.04	19	11,467	1.24
MED-BBK-9K	BBK	9,162	20	50,307	10.96	21	44,987	4.91
D-W-15K	DBpedia	15,000	167	73,983	8.55	175	66,813	4.40
D-W-15K	Wikidata	15,000	121	83,365	10.31	457	175,686	11.59

examples

We list some fragments of our industry benchmark as follows.

ent_links

<月经异常>\t<月经不调>
<弓形体病性巩膜炎>\t<弓形虫病性巩膜炎>
<巨趾症>\t<巨趾症>
<发细菌感染>\t<a40292>
<脑溢血后遗症>\t<脑出血后遗症>

rel_triples

<骨关节病>\t<典型症状>\t<僵硬>
<额区感觉减退>\t<相关疾病>\t<下肢动脉硬化闭塞症>
<绦虫病>\t<典型症状>\t<恶心>
<胆汁返流性胃炎>t<典型症状>\t<反酸>
<脐炎>\t<典型症状>\t<发热>

attr_triples

<病毒性食管炎>\t<英文名>\t<viralesophagitis>
<碱中毒>\t<临床表现>\t<它是呼吸系统对碱中毒的代偿现象，借助于浅而慢的呼吸，得以增加肺泡内的pco，使[bhco] [hhco]的分母加大，以减少因分子变大而发生的比值改变（稳定ph值）。躁动、兴奋、谵语、嗜睡、严重时昏迷。有手足搐搦，腱反射亢进等。如已发生钾缺乏，可能出现酸性尿的矛盾现象，应特别注意。标准碳酸氢（sb）、实际碳酸氢（ab）、缓冲碱（bb）、碱剩余（be）增加，血液paco、血液ph值升高。>
<十二指肠溃疡>\t<就诊科室>\t<消化内科>

Citation

If you have any difficulty or question in running code and reproducing experimental results, please email to [email protected]

If you use this model or code, please cite it as follows:

@inproceedings{zhang2020industry,
  title={An Industry Evaluation of Embedding-based Entity Alignment},
  author={Zhang, Ziheng and Liu, Hualuo and Chen, Jiaoyan and Chen, Xi and Liu, Bo and Xiang, YueJia and Zheng, Yefeng},
  booktitle={Proceedings of the 28th International Conference on Computational Linguistics: Industry Track},
  pages={179--189},
  year={2020}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 19

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗