Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → UCSD-AI4H → Covid Ct

UCSD-AI4H / Covid Ct

COVID-CT-Dataset: A CT Scan Dataset about COVID-19

Labels

jupyter-notebook deep-learning computer-vision dataset

Projects that are alternatives of or similar to Covid Ct

Data Science Hacks

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.

Stars: ✭ 273 (-66.71%)

Mutual labels: jupyter-notebook, dataset

Transportationnetworks

Transportation Networks for Research

Stars: ✭ 312 (-61.95%)

Mutual labels: jupyter-notebook, dataset

A python package to access tsetmc data

Stars: ✭ 282 (-65.61%)

Mutual labels: jupyter-notebook, dataset

Covid Chestxray Dataset

We are building an open database of COVID-19 cases with chest X-ray or CT images.

Stars: ✭ 2,759 (+236.46%)

Mutual labels: jupyter-notebook, dataset

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)

Stars: ✭ 382 (-53.41%)

Mutual labels: jupyter-notebook, dataset

🌮 Trash Annotations in Context Dataset Toolkit

Stars: ✭ 243 (-70.37%)

Mutual labels: jupyter-notebook, dataset

Covid19 twitter

Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development

Stars: ✭ 304 (-62.93%)

Mutual labels: jupyter-notebook, dataset

Tutorial: Web scraping in Python with Beautiful Soup

Stars: ✭ 201 (-75.49%)

Mutual labels: jupyter-notebook, dataset

[ISBI'21] MedMNIST Classification Decathlon: A Lightweight AutoML Benchmark for Medical Image Analysis

Stars: ✭ 338 (-58.78%)

Mutual labels: jupyter-notebook, dataset

Dsprites Dataset

Dataset to assess the disentanglement properties of unsupervised learning methods

Stars: ✭ 340 (-58.54%)

Mutual labels: jupyter-notebook, dataset

source{d} datasets ("big code") for source code analysis and machine learning on source code

Stars: ✭ 231 (-71.83%)

Mutual labels: jupyter-notebook, dataset

Hate Speech And Offensive Language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Stars: ✭ 543 (-33.78%)

Mutual labels: jupyter-notebook, dataset

A benchmark dataset for data-driven weather forecasting

Stars: ✭ 227 (-72.32%)

Mutual labels: jupyter-notebook, dataset

The ApolloScape Open Dataset for Autonomous Driving and its Application.

Stars: ✭ 260 (-68.29%)

Mutual labels: jupyter-notebook, dataset

Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa

Stars: ✭ 208 (-74.63%)

Mutual labels: jupyter-notebook, dataset

Datascience course

Curso de Data Science em Português

Stars: ✭ 294 (-64.15%)

Mutual labels: jupyter-notebook, dataset

Data Science Resources

👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋

Stars: ✭ 171 (-79.15%)

Mutual labels: jupyter-notebook, dataset

Fifa18 All Player Statistics

A complete catalog of all the players in Fifa 18 and their complete statistics.

Stars: ✭ 185 (-77.44%)

Mutual labels: jupyter-notebook, dataset

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 328 (-60%)

Mutual labels: jupyter-notebook, dataset

A driving dataset for the development and validation of fused pose estimators and mapping algorithms

Stars: ✭ 391 (-52.32%)

Mutual labels: jupyter-notebook, dataset

View All Similar Projects ➔

COVID-CT

The utility of this dataset has been confirmed by a senior radiologist in Tongji Hospital, Wuhan, China, who has performed diagnosis and treatment of a large number of COVID-19 patients during the outbreak of this disease between January and April.

After releasing this dataset, we received several feedbacks expressing concerns about the usability of this dataset. The major concerns are summarized as follows. First, when the original CT images are put into papers, the quality of these images is degraded, which may render the diagnosis decisions less accurate. The quality degradation includes: the Hounsfield unit (HU) values are lost; the number of bits per pixel is reduced; the resolution of images is reduced. Second, the original CT scan contains a sequence of CT slices, but when put into papers, only a few key slices are selected, which may have negative impact on diagnosis as well.

We consulted the aforementioned radiologist at Tongji Hospital regarding these two concerns. According to the radiologist, the issues raised in these concerns do not significantly affect the accuracy of diagnosis decision-making. First, experienced radiologists are able to make an accurate diagnosis from low quality CT images. For example, given a photo taken by smart phone of the original CT image, experienced radiologists can make an accurate diagnosis by just looking at the photo, though the CT image in the photo has much lower quality than the original CT image. Likewise, the quality gap between CT images in papers and original CT images will not largely hurt the accuracy of diagnosis. Second, while it is preferable to read a sequence of CT slices, oftentimes a single-slice of CT contains enough clinical information for accurate decision-making.

Data Description

The COVID-CT-Dataset has 349 CT images containing clinical findings of COVID-19 from 216 patients. They are in ./Images-processed/CT_COVID.zip

Non-COVID CT scans are in ./Images-processed/CT_NonCOVID.zip

We provide a data split in ./Data-split. Data split information see README for DenseNet_predict.md

The meta information (e.g., patient ID, patient information, DOI, image caption) is in COVID-CT-MetaInfo.xlsx

The images are collected from COVID19-related papers from medRxiv, bioRxiv, NEJM, JAMA, Lancet, etc. CTs containing COVID-19 abnormalities are selected by reading the figure captions in the papers. All copyrights of the data belong to the authors and publishers of these papers.

The dataset details are described in this preprint: COVID-CT-Dataset: A CT Scan Dataset about COVID-19

If you find this dataset and code useful, please cite:

@article{zhao2020COVID-CT-Dataset,
  title={COVID-CT-Dataset: a CT scan dataset about COVID-19},
  author={Zhao, Jinyu and Zhang, Yichen and He, Xuehai and Xie, Pengtao},
  journal={arXiv preprint arXiv:2003.13865}, 
  year={2020}
}

Baseline Performance

We developed two baseline methods for the community to benchmark with. The code are in the "baseline methods" folder and the details are in the readme files under that folder. The methods are described in Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans

If you find the code useful, please cite:

@Article{he2020sample,
  author  = {He, Xuehai and Yang, Xingyi and Zhang, Shanghang, and Zhao, Jinyu and Zhang, Yichen and Xing, Eric, and Xie,       Pengtao},
  title   = {Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans},
  journal = {medrxiv},
  year    = {2020},
}

Contribution Guide

To contribute to our project, please email your data to [email protected] with the corresponding meta information (Patient ID, DOI and Captions).
We recommend you also extract images from publications or preprints. Make sure the original papers you crawled have different DOIs from those listed in COVID-CT-MetaInfo.xlsx.
In COVID-CT-MetaInfo.xlsx, images with the form of 2020.mm.dd.xxxx are crawled from bioRxiv or medRxiv. The DOIs for these preprints are 10.1101/2020.mm.dd.xxxx.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 820

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (6) 🔗