All Projects → alexeygrigorev → clothing-dataset

alexeygrigorev / clothing-dataset

Licence: CC0-1.0 license
Closing dataset, all classes

Clothing dataset

Over 5,000 images of 20 different classes.

This dataset can be freely used for any purpose, including commercial:

For example:

  • Creating a tutorial or a course (free or paid)
  • Writing a book
  • Kaggle competitions (as an external dataset)
  • Training an internal model at any company

Data

The images.csv file contains:

  • image - the ID of the image (use it to load the image from images/<ID>.jpg)
  • sender_id - the ID of a person who contributed the image
  • label - the class of the image
  • kids - flag, True if it's clothes for kids

Links

Top-10 subset

Images of some classes don’t appear very often. Training a neural network to predict these classes is quite difficult — we need at least 100-200 images of each class to make a meaningful model.

That’s why, for educational purposes, we created a subset of the full dataset that covers only the top-10 classes.

Check it here: https://github.com/alexeygrigorev/clothing-dataset-small

Examples

Do you use this dataset somewhere? Please submit a PR with a link

Acknowledgements

We'd like to thank

  • Kenes Shangereyev and Tagias.com for helping with 3000 images
  • All the 32 people who contributed their images to the dataset via the forms:
  • Everyone who supported the initiative by engaging with the announcements on social media

It wouldn't be possible to collect this dataset without your help!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].