All Projects β†’ spinda β†’ Imagenetscraper

spinda / Imagenetscraper

Licence: agpl-3.0
πŸ‘ Bulk-download all thumbnails from an ImageNet synset, with optional rescaling

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Imagenetscraper

Cryptocmd
Cryptocurrency historical price data library in Python. Data from https://coinmarketcap.com.
Stars: ✭ 299 (+1145.83%)
Mutual labels:  dataset, scraper
Imagenetv2
A new test set for ImageNet
Stars: ✭ 109 (+354.17%)
Mutual labels:  dataset, imagenet
Label Studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Stars: ✭ 7,264 (+30166.67%)
Mutual labels:  dataset, imagenet
Mini Imagenet Tools
Tools for generating mini-ImageNet dataset and processing batches
Stars: ✭ 209 (+770.83%)
Mutual labels:  dataset, imagenet
Tensorflow object tracking video
Object Tracking in Tensorflow ( Localization Detection Classification ) developed to partecipate to ImageNET VID competition
Stars: ✭ 491 (+1945.83%)
Mutual labels:  dataset, imagenet
Cvat
Powerful and efficient Computer Vision Annotation Tool (CVAT)
Stars: ✭ 6,557 (+27220.83%)
Mutual labels:  dataset, imagenet
Caffenet Benchmark
Evaluation of the CNN design choices performance on ImageNet-2012.
Stars: ✭ 700 (+2816.67%)
Mutual labels:  dataset, imagenet
Covid Ct
COVID-CT-Dataset: A CT Scan Dataset about COVID-19
Stars: ✭ 820 (+3316.67%)
Mutual labels:  dataset
Football Data
football (soccer) datasets
Stars: ✭ 18 (-25%)
Mutual labels:  dataset
Osint collection
Maintained collection of OSINT related resources. (All Free & Actionable)
Stars: ✭ 809 (+3270.83%)
Mutual labels:  dataset
Switchable Normalization
Code for Switchable Normalization from "Differentiable Learning-to-Normalize via Switchable Normalization", https://arxiv.org/abs/1806.10779
Stars: ✭ 804 (+3250%)
Mutual labels:  imagenet
Insuranceqa Corpus Zh
🚁 δΏι™©θ‘ŒδΈšθ―­ζ–™εΊ“οΌŒθŠε€©ζœΊε™¨δΊΊ
Stars: ✭ 821 (+3320.83%)
Mutual labels:  dataset
Find Cheapest Flights
Use Google Flights API and scrape Expedia to find the cheapest/shortest flights!
Stars: ✭ 18 (-25%)
Mutual labels:  scraper
Datastream.io
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
Stars: ✭ 814 (+3291.67%)
Mutual labels:  dataset
Brainly Scraper
package to scrape www.brainly.co.id
Stars: ✭ 22 (-8.33%)
Mutual labels:  scraper
Safety Helmet Wearing Dataset
Safety helmet wearing detect dataset, with pretrained model
Stars: ✭ 802 (+3241.67%)
Mutual labels:  dataset
Rdhs
API Client and Data Munging for the Demographic and Health Survey Data
Stars: ✭ 22 (-8.33%)
Mutual labels:  dataset
Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-20.83%)
Mutual labels:  scraper
Flight Prices Scraper
Automated Script to scrape flight prices from any website into a csv format
Stars: ✭ 17 (-29.17%)
Mutual labels:  scraper
Gifhub
πŸ“ˆ Create GIFs from user's GitHub activity graph
Stars: ✭ 17 (-29.17%)
Mutual labels:  scraper

imagenetscraper

|pypi| |travis|

Command-line utility for downloading all thumbnail images from an ImageNet_ synset, optionally rescaling to a different resolution.

NOTICE: ImageNet downloads are currently offline. This is an upstream issue and out of my control. From the relevant announcement_:

While conducting our study, since January 2019 we have disabled downloads of
the full ImageNet data, except for the small subset of 1,000 categories used
in the ImageNet Challenge. We are in the process of implementing our
proposed remedies. 

Usage

::

Usage: imagenetscraper [OPTIONS] SYNSET_ID [OUTPUT_DIR]

Options:
  -c, --concurrency INTEGER  Number of concurrent downloads (default: 8).
  -s, --size WIDTH,HEIGHT    If specified, images will be rescaled to the
                             given size.
  -q, --quiet                Suppress progress output.
  -h, --help                 Show this message and exit.
  --version                  Show the version and exit.

If the URL of a synset page looks like:

::

http://image-net.org/synset?wnid=n00000000
                                 ^^^^^^^^^
                                 SYNSET_ID

SYNSET_ID is the n00000000 part. For example, for the "person, individual, someone, somebody, mortal, soul" synset at http://image-net.org/synset?wnid=n00007846, the corresponding synset id is n00007846.

The default output directory (OUTPUT_DIR) is the current directory.

Examples


To download all thumbnail imagess from the synset mentioned above to the directory "person_images", run:

::

imagenetscraper n00007846 person_images

To do the same, but with each thumbnail image rescaled to a width of 256 and a height of 128, add --size 256,128:

::

imagenetscraper n00007846 person_images --size 256,128

To run in "quiet mode", suppressing progress output, add --quiet:

::

imagenetscraper n00007846 person_images --size 256,128 --quiet

By default, imagenetscraper will download 8 images at once. To change this, use --concurrency:

::

imagenetscraper n00007846 person_images --size 256,128 --concurrency 4

Install

  1. Install Python 3, pip, and a development version of libjpeg. imagenetscraper is tested with Python 3.4-3.7 and libjpeg-turbo 8.

    ::

    sudo apt-get install python3 python3-pip libjpeg-turbo8-dev

  2. Download and install imagenetscraper with pip.

    ::

    sudo -H pip3 install imagenetscraper

Citation

If this tool helped with your research, a citation would be appreciated:

::

@Misc{imagenetscraper,
author = {Michael Smith},
title = {imagenetscraper: Bulk-download thumbnails from ImageNet synsets},
howpublished = {\url{https://github.com/spinda/imagenetscraper}},
year = {2017}
}

Testing

To run unit tests, use:

::

python3 setup.py test

License

Copyright (C) 2017-2020 Michael Smith [email protected]

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with this program. If not, see http://www.gnu.org/licenses/.

.. _ImageNet: http://image-net.org/ .. _announcement: http://image-net.org/update-sep-17-2019

.. |pypi| image:: https://img.shields.io/pypi/v/imagenetscraper.svg :alt: PyPI :target: https://pypi.python.org/pypi/imagenetscraper .. |travis| image:: https://img.shields.io/travis/spinda/imagenetscraper/master.svg :alt: Build Status :target: https://travis-ci.org/spinda/imagenetscraper

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].