All Projects → wangmengsd → pdd-graph

wangmengsd / pdd-graph

Licence: other
PDD Graph : Bridging MIMIC-III and Linked Data Cloud

Programming Languages

CSS
56736 projects

Projects that are alternatives of or similar to pdd-graph

BadMedicine
Library and CLI for randomly generating medical data like you might get out of an Electronic Health Records (EHR) system
Stars: ✭ 18 (-41.94%)
Mutual labels:  patient, ehr
freehealth
Free and open source Electronic Health Record
Stars: ✭ 39 (+25.81%)
Mutual labels:  emr, ehr
Openemr
The most popular open source electronic health records and medical practice management solution.
Stars: ✭ 1,762 (+5583.87%)
Mutual labels:  emr, ehr
app
Aplicación web para ANDES
Stars: ✭ 12 (-61.29%)
Mutual labels:  emr
biograkn
BioGrakn Knowledge Graph
Stars: ✭ 169 (+445.16%)
Mutual labels:  biomedical
Hello-AWS-Data-Services
Sample code for AWS data service and ML courses on LinkedIn Learning
Stars: ✭ 144 (+364.52%)
Mutual labels:  emr
halyos
Redesigning the Patient Portal Experience with SMART on FHIR.
Stars: ✭ 20 (-35.48%)
Mutual labels:  ehr
GEANet-BioMed-Event-Extraction
Code for the paper Biomedical Event Extraction with Hierarchical Knowledge Graphs
Stars: ✭ 52 (+67.74%)
Mutual labels:  biomedical
ICU4N
International Components for Unicode for .NET
Stars: ✭ 18 (-41.94%)
Mutual labels:  icu
terraform-emr-spark-example
An example Terraform project that will configure a Secure and Customizable Spark Cluster on Amazon EMR.
Stars: ✭ 43 (+38.71%)
Mutual labels:  emr
loinc2hpo
Java library to map LOINC-encoded test results to Human Phenotype Ontology
Stars: ✭ 19 (-38.71%)
Mutual labels:  ehr
build-scripts
Utility scripts for building of 3rd-party libraries
Stars: ✭ 33 (+6.45%)
Mutual labels:  icu
trove
Weakly supervised medical named entity classification
Stars: ✭ 55 (+77.42%)
Mutual labels:  biomedical
rail
Scalable RNA-seq analysis
Stars: ✭ 74 (+138.71%)
Mutual labels:  emr
scholia
Wikidata-based scholarly profiles
Stars: ✭ 166 (+435.48%)
Mutual labels:  linked-open-data
neji
Flexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (+19.35%)
Mutual labels:  biomedical
ehr-blockchain
Electronic Health Record (EHR) and Electronic Medical Record (EMR) systems. However, they still face some issues regarding the security of medical records, user ownership of data, data integrity etc. The solution to these issues could be the use of a novel technology, i.e., Blockchain. This technology offers to provide a secure, temper-proof pl…
Stars: ✭ 41 (+32.26%)
Mutual labels:  ehr
numishare
Numishare is an open source suite of applications for managing digital cultural heritage artifacts, with a particular focus on coins and medals.
Stars: ✭ 29 (-6.45%)
Mutual labels:  linked-open-data
sbt-lighter
SBT plugin for Apache Spark on AWS EMR
Stars: ✭ 57 (+83.87%)
Mutual labels:  emr
MIRACL
Multi-modal Image Registration And Connectivity anaLysis
Stars: ✭ 23 (-25.81%)
Mutual labels:  biomedical

PDD Graph: Patient-Disease-Drug Graph, Bridging MIMIC-III and Linked Data Cloud

The latest news

We just released the new version 1.4! Go

Note

If you use PDD Graph data in your work, please cite the following publication:

  • Wang, M., Zheng, J., Liu, J., Hu, W., Wang, S., Li, X., & Liu, W. (2017, October). PDD Graph: Bridging Electronic Medical Records and Biomedical Knowledge Graphs via Entity Linking. In International Semantic Web Conference (ISWC). Springer, Cham, 2017: 219-227.

We ask that users who download significant portions of the database cite the MIMIC-III paper in any resulting publications.

  • MIMIC-III, a freely accessible critical care database. Johnson AEW, Pollard TJ, Shen L, Lehman L, Feng M, Ghassemi M, Moody B, Szolovits P, Celi LA, and Mark RG. Scientific Data (2016). DOI: 10.1038/sdata.2016.35.

Introduction

The gap between clinical data and biomedical knowledge graphs:

  • An EMR database, MIMIC-III: contains multi-format electronic data but remains limitations in scope.
  • Biomedical KGs: cover basic medical facts, but contain little information about clinical outcomes.

The gap between clinical data and biomedical KGs prohibits further exploring medical entity relationships on ether side, as shown in the following figure:

What is PDD Graph (Patient-Disease-Drug Graph):

Electronic medical records contain multi-format electronic medical data that consist of an abundance of medical knowledge. Facing with patients symptoms, experienced caregivers make right medical decisions based on their professional knowledge that accurately grasps relationships between symptoms, diagnosis, and treatments. We aim to capture these relationships by constructing a large and high-quality heterogeneous graph linking patients, diseases, and drugs (PDD) in EMRs.

Specifically, we extract important medical entities from MIMIC-III (Medical Information Mart for Intensive Care III) and automatically link them with the existing biomedical knowledge graphs, including ICD-9 ontology and DrugBank. The PDD graph presented is accessible on the Web via the SPARQL endpoint, and provides a pathway for medical discovery and applications, such as effective treatment recommendations.

A subgraph of PDD is illustrated in the following figure to better understand the PDD graph.

Download

You can download it from different places. We offer two different formats in this site .

Format 1:N-Triples

RDF data files ended by .nt.

Format 2:Apache Jena

Formatted by Apache Jena TDB , you can user it in Jena API.

Download elsewhere

Friendly Link

Our data uses other resources, so we make a statement here.

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Contact

  • Meng Wang [email protected]
  • Lei Shi [email protected]
  • Update

    • V1.4

      In the new version 1.4, we add detailed information about the prescriptions, including the dosage, the duration and so on. These information will be applied to conveniently retrieve the exact adverse drug combinations taken by corresponding patients.

      A specific example please refer to the Tutorial SPARQL Query Example5

    • V1.3

      We have attached DDI triples in the latest version 1.3. These DDI triples are extracted from DrugBank and will be applied to conveniently retrieve the possible adverse drug combinations taken by corresponding patients.

    • V1.2

      Fix the bugs in "diagnose_icd_information.nt".

      In the new version, we have eliminated an engineering bug that was made when label matching of ICD-9 codes. This bug results in the linking failures of 380 diseases in MIMIC-III.

      For diseases in the latest PDD version, the overall number of diseases is 6985, and 6,983 diseases are connected to ICD-9 ontology. The only two failed matching codes are '71970' and 'NULL', which are not included in ICD-9 ontology.

    • V1.1

      Add Patient BMI data.

    More Information

    Github Pages

    Website

    DataHub URL

    Data dump

    SPARQL endpoint

    Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].