All Projects → nperraud → Download Celeba Hq

nperraud / Download Celeba Hq

Python script to download the celebA-HQ dataset from google drive

Projects that are alternatives of or similar to Download Celeba Hq

Nb2xls
Convert Jupyter notebook to Excel spreadsheet
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Accelerated dl pytorch
Accelerated Deep Learning with PyTorch at Jupyter Day Atlanta II
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Mml Companion
This is a companion to the ‘Mathematical Foundations’ section of the book, Mathematics for Machine Learning by Marc Deisenroth, Aldo Faisal and Cheng Ong, written in python for Jupyter Notebook.
Stars: ✭ 130 (+0%)
Mutual labels:  jupyter-notebook
Stanford Machine Learning Camp
Stars: ✭ 128 (-1.54%)
Mutual labels:  jupyter-notebook
Pytorch Book
Source codes for the book "Application of Neural Network and PyTorch"
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Numba tutorial scipy2016
Numba tutorial materials for Scipy 2016
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Pytorch Ensembles
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning, ICLR 2020
Stars: ✭ 128 (-1.54%)
Mutual labels:  jupyter-notebook
Regularized Linear Autoencoders
Loss Landscapes of Regularized Linear Autoencoders
Stars: ✭ 130 (+0%)
Mutual labels:  jupyter-notebook
Reptile Pytorch
A PyTorch implementation of OpenAI's REPTILE algorithm
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Mltutorial
Machine Learning Tutorial in IPython Notebooks
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
My deep project
个人深度学习项目整理
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Dask Tutorial Pycon 2018
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Siamese net
This package shows how to train a siamese network using Lasagne and Theano and includes network definitions for state-of-the-art networks including: DeepID, DeepID2, Chopra et. al, and Hani et. al. We also include one pre-trained model using a custom convolutional network.
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Eewpython
A series of Jupyter notebook to learn Google Earth Engine with Python
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Citylearn
Official reinforcement learning environment for demand response and load shaping
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook
Real Time Sentiment Tracking On Twitter For Brand Improvement And Trend Recognition
A real-time interactive web app based on data pipelines using streaming Twitter data, automated sentiment analysis, and MySQL&PostgreSQL database (Deployed on Heroku)
Stars: ✭ 127 (-2.31%)
Mutual labels:  jupyter-notebook
Micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Stars: ✭ 1,848 (+1321.54%)
Mutual labels:  jupyter-notebook
Slides Scipyconf 2018
A repository for public storage of slides given at the 17th Python in Science Conferences (2018)
Stars: ✭ 130 (+0%)
Mutual labels:  jupyter-notebook
Object detection demo
How to train an object detection model easy for free
Stars: ✭ 130 (+0%)
Mutual labels:  jupyter-notebook
Hands on julia
Stars: ✭ 129 (-0.77%)
Mutual labels:  jupyter-notebook

download-celebA-HQ

Python script to download and create the celebA-HQ dataset.

WARNING from the author. I believe this script is broken since a few months (I have not try it for a while). I am really sorry about that. If you fix it, please share you solution in a PR so that everyone can benefit from it.

To get the celebA-HQ dataset, you need to a) download the celebA dataset download_celebA.py , b) download some extra files download_celebA_HQ.py, c) do some processing to get the HQ images make_HQ_images.py.

The size of the final dataset is 89G. However, you will need a bit more storage to be able to run the scripts.

Usage

  1. Clone the repository
git clone https://github.com/nperraud/download-celebA-HQ.git
cd download-celebA-HQ
  1. Install necessary packages (Because specific versions are required Conda is recomended)
conda create -n celebaHQ python=3
source activate celebaHQ
  • Install the packages
conda install jpeg=8d tqdm requests pillow==3.1.1 urllib3 numpy cryptography scipy
pip install opencv-python==3.4.0.12 cryptography==2.1.4
  • Install 7zip (On Ubuntu)
sudo apt-get install p7zip-full
  1. Run the scripts
python download_celebA.py ./
python download_celebA_HQ.py ./
python make_HQ_images.py ./

where ./ is the directory where you wish the data to be saved.

  1. Go watch a movie, theses scripts will take a few hours to run depending on your internet connection and your CPU power. The final HQ images will be saved as .npy files in the ./celebA-HQ folder.

Windows

The script may work on windows, though I have not tested this solution personnaly

Step 2 becomes

conda create -n celebaHQ python=3
source activate celebaHQ
  • Install the packages
conda  install -c anaconda jpeg=8d tqdm requests pillow==3.1.1 urllib3 numpy cryptography scipy
  • Install 7zip

The rest should be unchanged.

Docker

If you have Docker installed, skip the previous installation steps and run the following command from the root directory of this project:

docker build -t celeba . && docker run -it -v $(pwd):/data celeba

By default, this will create the dataset in same directory. To put it elsewhere, replace $(pwd) with the absolute path to the desired output directory.

Outliers

It seems that the dataset has a few outliers. A of problematic images is stored in bad_images.txt. Please report if you find other outliers.

Remark

This script is likely to break somewhere, but if it executes until the end, you should obtain the correct dataset.

Sources

This code is inspired by these files

Citing the dataset

You probably want to cite the paper "Progressive Growing of GANs for Improved Quality, Stability, and Variation" that was submitted to ICLR 2018 by Tero Karras (NVIDIA), Timo Aila (NVIDIA), Samuli Laine (NVIDIA), Jaakko Lehtinen (NVIDIA and Aalto University).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].