All Projects β†’ unsplash β†’ Datasets

unsplash / Datasets

🎁 3,000,000+ Unsplash images made available for research and machine learning

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Datasets

Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-84.88%)
Mutual labels:  jupyter-notebook, dataset, data
Unsplash rb
πŸ’Ž Ruby wrapper for the Unsplash API.
Stars: ✭ 202 (-88.81%)
Mutual labels:  unsplash, images, photos
Natural Language Image Search
Search photos on Unsplash using natural language
Stars: ✭ 359 (-80.11%)
Mutual labels:  jupyter-notebook, unsplash, photos
Data Science Resources
πŸ‘¨πŸ½β€πŸ«You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?πŸ”‹
Stars: ✭ 171 (-90.53%)
Mutual labels:  jupyter-notebook, dataset, data
Datascience course
Curso de Data Science em PortuguΓͺs
Stars: ✭ 294 (-83.71%)
Mutual labels:  jupyter-notebook, dataset, data
Unsplash Php
πŸ‘» Official PHP wrapper for the Unsplash API
Stars: ✭ 332 (-81.61%)
Mutual labels:  unsplash, images, photos
Unsplash Js
πŸ€– A server-side JavaScript wrapper for the Unsplash API
Stars: ✭ 1,647 (-8.75%)
Mutual labels:  unsplash, images, photos
Coronawatchnl
Numbers concerning COVID-19 disease cases in The Netherlands by RIVM, LCPS, NICE, ECML, and Rijksoverheid.
Stars: ✭ 135 (-92.52%)
Mutual labels:  jupyter-notebook, dataset
Faceaging By Cyclegan
Stars: ✭ 105 (-94.18%)
Mutual labels:  jupyter-notebook, dataset
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-93.96%)
Mutual labels:  jupyter-notebook, data
Hypertag
Knowledge Management for Humans using Machine Learning & Tags
Stars: ✭ 116 (-93.57%)
Mutual labels:  search-engine, images
Fma
FMA: A Dataset For Music Analysis
Stars: ✭ 1,391 (-22.94%)
Mutual labels:  jupyter-notebook, dataset
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-23.66%)
Mutual labels:  jupyter-notebook, data
Imagenetv2
A new test set for ImageNet
Stars: ✭ 109 (-93.96%)
Mutual labels:  jupyter-notebook, dataset
Rasalit
Visualizations and helpers to improve and debug machine learning models for Rasa Open Source
Stars: ✭ 101 (-94.4%)
Mutual labels:  jupyter-notebook, research
Protest Detection Violence Estimation
Implementation of the model used in the paper Protest Activity Detection and Perceived Violence Estimation from Social Media Images (ACM Multimedia 2017)
Stars: ✭ 114 (-93.68%)
Mutual labels:  jupyter-notebook, dataset
Iso 3166 Countries With Regional Codes
ISO 3166-1 country lists merged with their UN Geoscheme regional codes in ready-to-use JSON, XML, CSV data sets
Stars: ✭ 1,372 (-23.99%)
Mutual labels:  dataset, data
Bertqa Attention On Steroids
BertQA - Attention on Steroids
Stars: ✭ 112 (-93.8%)
Mutual labels:  jupyter-notebook, dataset
Know Your Intent
State of the Art results in Intent Classification using Sematic Hashing for three datasets: AskUbuntu, Chatbot and WebApplication.
Stars: ✭ 116 (-93.57%)
Mutual labels:  jupyter-notebook, dataset
Dbg Pds
Deutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-93.13%)
Mutual labels:  dataset, data

The Unsplash Dataset

The Unsplash Dataset is made up of over 250,000+ contributing global photographers and data sourced from hundreds of millions of searches across a nearly unlimited number of uses and contexts. Due to the breadth of intent and semantics contained within the Unsplash dataset, it enables new opportunities for research and learning.

The Unsplash Dataset is offered in two datasets:

  • the Lite dataset: available for commercial and noncommercial usage, containing 25k nature-themed Unsplash photos, 25k keywords, and 1M searches
  • the Full dataset: available for noncommercial usage, containing 3M+ high-quality Unsplash photos, 5M keywords, and over 250M searches

As the Unsplash library continues to grow, we’ll release updates to the dataset with new fields and new images, with each subsequent release being semantically versioned.

We welcome any feedback regarding the content of the datasets or their format. With your input, we hope to close the gap between the data we provide and the data that you would like to leverage. You can open an issue to report a problem or to let us know what you would like to see in the next release of the datasets.

For more on the Unsplash Dataset, see our announcement and site.

Download

Lite Dataset

The Lite dataset contains all of the same fields as the Full dataset, but is limited to ~25,000 photos. It can be used for both commercial and non-commercial usage, provided you abide by the terms.

⬇️ Download the Lite dataset [~650MB compressed, ~1.4GB raw]

Full Dataset

The Full dataset is available for non-commercial usage and all uses must abide by the terms. To access, please go to unsplash.com/data and request access. The dataset weighs ~20 GB compressed (~43GB raw)).

Documentation

See the documentation for a complete list of tables and fields.

Usage

You can follow these examples to load the dataset in these common formats:

Share your work

We're making this data open and available with the hopes of enabling researchers and developers to discover interesting and useful connections in the data.

We'd love to see what you create, whether that's a research paper, a machine learning model, a blog post, or just an interesting discovery in the data. Send us an email at [email protected].

If you're using the dataset in a research paper, you can attribute the dataset as Unsplash Lite Dataset 1.2.0 or Unsplash Full Dataset 1.2.0 and link to the permalink unsplash.com/data.


The Unsplash Dataset is made available for research purposes. It cannot be used to redistribute the images contained within. To use the Unsplash library in a product, see the Unsplash API.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].