All Projects → studiomoniker → Quickdraw Appendix

studiomoniker / Quickdraw Appendix

Licence: other
Dataset of 25k penises: an appendix to the Quick, Draw! Dataset

Projects that are alternatives of or similar to Quickdraw Appendix

Gossiping Chinese Corpus
PTT 八卦版問答中文語料
Stars: ✭ 137 (-10.46%)
Mutual labels:  dataset
Clue
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Stars: ✭ 2,425 (+1484.97%)
Mutual labels:  dataset
Opentraj
Human Trajectory Prediction Dataset Benchmark (ACCV 2020)
Stars: ✭ 144 (-5.88%)
Mutual labels:  dataset
Ml Datasets
Machine Learning datasets for Nepal
Stars: ✭ 139 (-9.15%)
Mutual labels:  dataset
Triggerner
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
Stars: ✭ 141 (-7.84%)
Mutual labels:  dataset
Conversation Tensorflow
TensorFlow implementation of Conversation Models
Stars: ✭ 143 (-6.54%)
Mutual labels:  dataset
Personal Security Checklist
🔒 A curated checklist of 300+ tips for protecting digital security and privacy in 2021
Stars: ✭ 2,388 (+1460.78%)
Mutual labels:  censorship
Music Dance Video Synthesis
(ACM MM 20 Oral) PyTorch implementation of Self-supervised Dance Video Synthesis Conditioned on Music
Stars: ✭ 150 (-1.96%)
Mutual labels:  dataset
Lacmus
Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.
Stars: ✭ 142 (-7.19%)
Mutual labels:  dataset
Face Detect
A Python based tool to extract faces from any picture.
Stars: ✭ 146 (-4.58%)
Mutual labels:  dataset
Coffee Quality Database
Building the Coffee Quality Institute Database
Stars: ✭ 141 (-7.84%)
Mutual labels:  dataset
Covid 19 Timeline
请关注端点星案和张展。// 以社会学年鉴模式体例规范地统编自2019年末起武汉新冠肺炎疫情进展的时间线(2019年12月1日-2020年4月24日)。感谢志愿者的辛劳操作。A sociology timeline (2019.12.1-2020.4.24) on how Wuhan Coronavirus break and spread, edited by anonymous volunteers.
Stars: ✭ 142 (-7.19%)
Mutual labels:  censorship
Dstc7 End To End Conversation Modeling
Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)
Stars: ✭ 141 (-7.84%)
Mutual labels:  dataset
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-9.15%)
Mutual labels:  dataset
Lapa Dataset
A large-scale dataset for face parsing (AAAI2020)
Stars: ✭ 149 (-2.61%)
Mutual labels:  dataset
Dataspice
🌶 Create lightweight schema.org descriptions of your datasets
Stars: ✭ 137 (-10.46%)
Mutual labels:  dataset
Baidutraffic
This repo includes introduction, code and dataset of our paper Deep Sequence Learning with Auxiliary Information for Traffic Prediction (KDD 2018).
Stars: ✭ 143 (-6.54%)
Mutual labels:  dataset
Maskedface Net
MaskedFace-Net is a dataset of human faces with a correctly and incorrectly worn mask based on the dataset Flickr-Faces-HQ (FFHQ).
Stars: ✭ 152 (-0.65%)
Mutual labels:  dataset
Financial News Dataset
Reuters and Bloomberg
Stars: ✭ 147 (-3.92%)
Mutual labels:  dataset
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-6.54%)
Mutual labels:  dataset

The 'Do not draw a penis?' Dataset

grid In 2018 Google open-sourced the Quick, Draw! dataset. “The world's largest doodling dataset”. The set consists of 345 categories and over 15 million drawings. For obvious reasons the dataset was missing a few specific categories that people seem to enjoy drawing. This made us at Moniker think about the moral reality big tech companies are imposing on our global community and that most people willingly accept this. Therefore we decided to publish an appendix to the Google Quickdraw dataset.

So far we have collected 25,000 doodles formatted the same way as Google's dataset. We are happy to announce you can download them here. We have collected the first 10,000 doodles using Amazon's Mechanical Turk, which were drudgingly audited by the staff here at Moniker.

In June of 2019 we released the Do Not Draw a Penis project to collect inappropriate doodles from people who are not willing to stay within the moral guidelines set by our social network providers. It has helped us to collect another 250,000 doodles of which we have marked 15,000 suitable for this appendix.

Dataset's provided

Similar to Google's QuickDraw dataset, we offer the data in the following forms. More information on how to interpret this data can be found here.

Relevant Locations

Technologies

Data collection:

  • Amazon's Mechanical Turk
  • Do not draw

Credits

Concept & development by Moniker Luna Maurer & Roel Wouters

Commissioners

Mozilla, Brett Gaylor HKW, Daniel Neugebauer

Technical Development

Moniker, Tjerk Woudsma, Thomas Boland, Jae Perris

License

This data made available by Moniker under the Creative Commons Attribution 4.0 International license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].