All Projects → razorx89 → roco-dataset

razorx89 / roco-dataset

Licence: other
Radiology Objects in COntext (ROCO): A Multimodal Image Dataset

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to roco-dataset

CODER
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Stars: ✭ 24 (-36.84%)
Mutual labels:  medical, umls
rid-covid
Image-based COVID-19 diagnosis. Links to software, data, and other resources.
Stars: ✭ 74 (+94.74%)
Mutual labels:  medical, radiology
DICOM.jl
Julia package for reading and writing DICOM (Digital Imaging and Communications in Medicine) files
Stars: ✭ 45 (+18.42%)
Mutual labels:  medical, radiology
wolfpacs
WolfPACS is an DICOM load balancer written in Erlang.
Stars: ✭ 1 (-97.37%)
Mutual labels:  medical, radiology
ParseCareKit
Securely synchronize any CareKit 2.1+ based app to a Parse Server Cloud. Compatible with parse-hipaa.
Stars: ✭ 28 (-26.32%)
Mutual labels:  medical
gnn-re-ranking
A real-time GNN-based method. Understanding Image Retrieval Re-Ranking: A Graph Neural Network Perspective
Stars: ✭ 64 (+68.42%)
Mutual labels:  image-retrieval
Ranked-List-Loss-for-DML
CVPR 2019: Ranked List Loss for Deep Metric Learning, with extension for TPAMI submission
Stars: ✭ 56 (+47.37%)
Mutual labels:  image-retrieval
street2shopTriplet
Challenge the customer to shop task with tripletNet
Stars: ✭ 14 (-63.16%)
Mutual labels:  image-retrieval
SOLAR
PyTorch code for "SOLAR: Second-Order Loss and Attention for Image Retrieval". In ECCV 2020
Stars: ✭ 150 (+294.74%)
Mutual labels:  image-retrieval
XCloud
Official Code for Paper <XCloud: Design and Implementation of AI Cloud Platform with RESTful API Service> (arXiv1912.10344)
Stars: ✭ 58 (+52.63%)
Mutual labels:  image-retrieval
Emory-BMI-GSoC
Emory BMI GSoC Project Ideas
Stars: ✭ 27 (-28.95%)
Mutual labels:  radiology
dicom-standard-chinese
Chinese translation of DICOM standard, DICOM协议中文版
Stars: ✭ 26 (-31.58%)
Mutual labels:  medical
PCLoc
Pose Correction for Highly Accurate Visual Localization in Large-scale Indoor Spaces (ICCV 2021)
Stars: ✭ 37 (-2.63%)
Mutual labels:  image-retrieval
eye-tracker-setup
👀 Tobii Eye Tracker 4C Setup
Stars: ✭ 24 (-36.84%)
Mutual labels:  radiology
towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Stars: ✭ 821 (+2060.53%)
Mutual labels:  image-retrieval
covid-19-image-repository
Anonymized dataset of COVID-19 cases with a focus on radiological imaging. This includes images (x-ray / ct) with extensive metadata, such as admission-, ICU-, laboratory-, and patient master-data.
Stars: ✭ 42 (+10.53%)
Mutual labels:  radiology
fuse-med-ml
A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
Stars: ✭ 66 (+73.68%)
Mutual labels:  medical
metamaplite
A near real-time named-entity recognizer
Stars: ✭ 37 (-2.63%)
Mutual labels:  umls
mildnet
Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.
Stars: ✭ 76 (+100%)
Mutual labels:  image-retrieval
trove
Weakly supervised medical named entity classification
Stars: ✭ 55 (+44.74%)
Mutual labels:  umls

Radiology Objects in COntext (ROCO): A Multimodal Image Dataset

This repository contains the Radiology Objects in COntext (ROCO) dataset, a large-scale medical and multimodal imaging dataset. The listed images are from publications available on the PubMed Central Open Access FTP mirror, which were automatically detected as non-compound and either radiology or non-radiology. Each image is distributed as a download link, together with its caption. Additionally, keywords extracted from the image caption, as well as the corresponding UMLS Semantic Types (SemTypes) and UMLS Concept Unique Identifiers (CUIs) are available. The dataset could be used to build generative models for image captioning, classification models for image categorization and tagging or content-based image retrieval systems.

A subset of the ROCO dataset is used as development data for the Concept Detection Task at ImageCLEF 2019. Further information regarding the task description, submission dates, and publication opportunities can be found at ImageCLEFmed Caption 2019 and CrowdAI.

Get started

To download the images, clone the repository and run

python scripts/fetch.py

Troubleshooting

If you see many download errors and/or you have a slow internet connection, try to reduce the number of processes to one:

python scripts/fetch.py -n 1

If on Windows, make sure wget is installed and its location (e.g. C:\Program Files (x86)\GnuWin32\bin) is added to the Path environment variable. Or install Ubuntu on WSL.

Citation

If you use this dataset for your own research, please cite the following paper:

O. Pelka, S. Koitka, J. Rückert, F. Nensa, C.M. Friedrich,
"Radiology Objects in COntext (ROCO): A Multimodal Image Dataset".
MICCAI Workshop on Large-scale Annotation of Biomedical Data and Expert Label Synthesis (LABELS) 2018, September 16, 2018, Granada, Spain. Lecture Notes on Computer Science (LNCS), vol. 11043, pp. 180-189, Springer Cham, 2018.
doi: 10.1007/978-3-030-01364-6_20

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].