All Projects → usuyama → ePillID-benchmark

usuyama / ePillID-benchmark

Licence: MIT license
ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification (CVPR 2020 VL3)

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to ePillID-benchmark

Catalyst
Accelerated deep learning R&D
Stars: ✭ 2,804 (+5092.59%)
Mutual labels:  image-classification, metric-learning
CoreML-and-Vision-with-a-pre-trained-deep-learning-SSD-model
This project shows how to use CoreML and Vision with a pre-trained deep learning SSD (Single Shot MultiBox Detector) model. There are many variations of SSD. The one we’re going to use is MobileNetV2 as the backbone this model also has separable convolutions for the SSD layers, also known as SSDLite. This app can find the locations of several di…
Stars: ✭ 16 (-70.37%)
Mutual labels:  image-classification
ML4K-AI-Extension
Use machine learning in AppInventor, with easy training using text, images, or numbers through the Machine Learning for Kids website.
Stars: ✭ 18 (-66.67%)
Mutual labels:  image-classification
deep-parking
Code to reproduce 'Deep Learning for Decentralized Parking Lot Occupancy Detection' paper.
Stars: ✭ 81 (+50%)
Mutual labels:  image-classification
image-classification
A collection of SOTA Image Classification Models in PyTorch
Stars: ✭ 70 (+29.63%)
Mutual labels:  image-classification
SPML
Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning
Stars: ✭ 81 (+50%)
Mutual labels:  metric-learning
image-recognition
采用深度学习方法进行刀具识别。
Stars: ✭ 19 (-64.81%)
Mutual labels:  image-classification
CareCards
This mighty little deck of cards will change your health habits.
Stars: ✭ 25 (-53.7%)
Mutual labels:  healthcare
FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
Stars: ✭ 204 (+277.78%)
Mutual labels:  image-classification
pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Stars: ✭ 250 (+362.96%)
Mutual labels:  image-classification
ProtoTree
ProtoTrees: Neural Prototype Trees for Interpretable Fine-grained Image Recognition, published at CVPR2021
Stars: ✭ 47 (-12.96%)
Mutual labels:  fine-grained-visual-categorization
clara-dicom-adapter
DICOM Adapter is a component of the Clara Deploy SDK which facilitates integration with DICOM compliant systems, enables ingestion of imaging data, helps triggering of jobs with configurable rules and offers pushing the output of jobs to PACS systems.
Stars: ✭ 31 (-42.59%)
Mutual labels:  healthcare
classification
Catalyst.Classification
Stars: ✭ 35 (-35.19%)
Mutual labels:  image-classification
Poke-Pi-Dex
Our deep learning for computer vision related project for nostalgic poke weebs (Sistemi digitali, Unibo).
Stars: ✭ 18 (-66.67%)
Mutual labels:  image-classification
MinkLoc3D
MinkLoc3D: Point Cloud Based Large-Scale Place Recognition
Stars: ✭ 83 (+53.7%)
Mutual labels:  metric-learning
awesome-list-machine-learning-healthcare
A list of awesome resources on the application of machine learning in healthcare
Stars: ✭ 17 (-68.52%)
Mutual labels:  healthcare
Deploy-ML-model
No description or website provided.
Stars: ✭ 57 (+5.56%)
Mutual labels:  image-classification
recipe1m.bootstrap.pytorch
Retrieve recipes from foodie pictures using Deep Learning and Pytorch
Stars: ✭ 42 (-22.22%)
Mutual labels:  metric-learning
open-source-cure-for-cancer
Using AI to figure out cancer
Stars: ✭ 45 (-16.67%)
Mutual labels:  healthcare
CNN-Image-Classifier
A Simple Deep Neural Network to classify images made with Keras
Stars: ✭ 65 (+20.37%)
Mutual labels:  image-classification

ePillID Benchmark Open In Colab

PWC

This repository contains data and code for ePillID - a benchmark for developing and evaluating computer vision models for pill identification. The ePillID benchmark is designed as a low-shot fine-grained benchmark, reflecting real-world challenges for developing image-based pill identification systems.

The characteristics of the ePillID benchmark include:

  • Reference and consumer images: The reference images are taken with controlled lighting and backgrounds, and with professional equipment. The consumer images are taken with real-world settings including different lighting, backgrounds, and equipment. For most of the pills, one image per side (two images per pill type) is available from the NIH Pillbox dataset.

Examples of reference and consumer images in ePillID

  • Low-shot and fine-grained setting: 13k images representing 9804 appearance classes (two sides for 4902 pill types). For most of the appearance classes, there exists only one reference image, making it a challenging low-shot recognition setting.

ePillID data distribution

  • The best performing model at the time of publication is a multi-head metric learning approach. Following a CNN encoder, a bilinear transformation layer is used to represent fine-grained embeddings.

Model overview

Please refer to our paper for more details:

@inproceedings{usuyama2020epillid,
  title={ePillID Dataset: A Low-Shot Fine-Grained Benchmark for Pill Identification},
  author={Usuyama, Naoto and Delgado, Natalia Larios and Hall, Amanda K and Lundin, Jessica},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
  year={2020}
}

Data

Downloadable from the releases page.

Docker

Docker image https://hub.docker.com/r/naotous/pytorch-image

sudo nvidia-docker pull naotous/pytorch-image:py36torch041-legacy

Train and evaluate models

sudo nvidia-docker run --ipc=host -it -v $HOME:$HOME naotous/pytorch-image:py36torch041-legacy /bin/bash

use -v mount option to access folders from Docker containers

source activate py36

python train_cv.py --appearance_network resnet18 --max_epochs 10 --data_root_dir /home/naotous/ePillID_data

NOTE: max_epochs 10 for quick testing

Train and evaluate models on Azure ML

python azureml_submit_runs.py ePillID_0430

Notes and acknowledgments

The ePillID dataset includes customized data from two public NIH NLM databases. Please refer to their websites for original data:

For bilinear pooling implementations, we forked fast-MPN-Cov GitHub repo.

Please consider citing their work as well when you find our work useful in your research.

Disclaimer

This dataset and software are released for research purposes only.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].