Anonymized dataset of COVID-19 cases with a focus on radiological imaging. This includes images (x-ray / ct) with extensive metadata, such as admission-, ICU-, laboratory-, and patient master-data.

Stars: ✭ 42 (+10.53%)

Mutual labels: radiology

fuse-med-ml

A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)

Stars: ✭ 66 (+73.68%)

Mutual labels: medical

metamaplite

A near real-time named-entity recognizer

Stars: ✭ 37 (-2.63%)

Mutual labels: umls

mildnet

Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.

Stars: ✭ 76 (+100%)

Mutual labels: image-retrieval

trove

Weakly supervised medical named entity classification

Stars: ✭ 55 (+44.74%)

Mutual labels: umls

View All Similar Projects ➔

Radiology Objects in COntext (ROCO): A Multimodal Image Dataset

This repository contains the Radiology Objects in COntext (ROCO) dataset, a large-scale medical and multimodal imaging dataset. The listed images are from publications available on the PubMed Central Open Access FTP mirror, which were automatically detected as non-compound and either radiology or non-radiology. Each image is distributed as a download link, together with its caption. Additionally, keywords extracted from the image caption, as well as the corresponding UMLS Semantic Types (SemTypes) and UMLS Concept Unique Identifiers (CUIs) are available. The dataset could be used to build generative models for image captioning, classification models for image categorization and tagging or content-based image retrieval systems.

A subset of the ROCO dataset is used as development data for the Concept Detection Task at ImageCLEF 2019. Further information regarding the task description, submission dates, and publication opportunities can be found at ImageCLEFmed Caption 2019 and CrowdAI.

Get started

To download the images, clone the repository and run

python scripts/fetch.py

Troubleshooting

If you see many download errors and/or you have a slow internet connection, try to reduce the number of processes to one:

python scripts/fetch.py -n 1

If on Windows, make sure wget is installed and its location (e.g. C:\Program Files (x86)\GnuWin32\bin) is added to the Path environment variable. Or install Ubuntu on WSL.

Citation

If you use this dataset for your own research, please cite the following paper:

O. Pelka, S. Koitka, J. Rückert, F. Nensa, C.M. Friedrich,
"Radiology Objects in COntext (ROCO): A Multimodal Image Dataset".
MICCAI Workshop on Large-scale Annotation of Biomedical Data and Expert Label Synthesis (LABELS) 2018, September 16, 2018, Granada, Spain. Lecture Notes on Computer Science (LNCS), vol. 11043, pp. 180-189, Springer Cham, 2018.
doi: 10.1007/978-3-030-01364-6_20

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

razorx89 / roco-dataset

Programming Languages

Labels

Projects that are alternatives of or similar to roco-dataset

Radiology Objects in COntext (ROCO): A Multimodal Image Dataset

Get started

Troubleshooting

Citation