Deep Learning Roadmap

My own deep learning mastery roadmap, inspired by Deep Learning Papers Reading Roadmap.

There are some customized differences:

not only academic papers but also blog posts, online courses, and other references are included
customized for my own plans - may not include RL, NLP, etc.
updated for 2019 SOTA

Introductory Courses

Basic CNN Architectures

Generative adversarial networks

Advanced GANs

Autoencoders

Original autoencoder (1986) [paper]
- Rumelhart, Hinton, and Williams, "Learning Internal Representations by Error Propagation"
AutoEncoder [science]
- Hinton et al., "Reducing the Dimensionality of Data with Neural Networks"
Denoising Autoencoders (2008) [paper]
- Vincent et al. "Extracting and Composing Robust Features with Denoising Autoencoders"
Wasserstein Autoencoder (2017) [paper]
- Tolstikhin et al. "Wasserstein Auto Encoders"

Autoregressive models

PixelCNN (2016) [paper]
- van den Oord et al. "Conditional image generation with PixelCNN decoders."
WaveNet (2016) [paper]
- van den Oord et al. "WaveNet: A Generative Model for Raw Audio"
tacotron?

Layer Normalizations

Batch Normalization (2015.2) [paper]
- Ioeffe et al. "Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift"
Group Norm
Instance Normalization (2016.7) [paper]
- Ulyanov et al. "Instance Normalization: The Missing Ingredient for Fast Stylization"
Santurkar et al. "How does Batch Normalization help Optimization?" (2018.5) [paper]
Switchable Normalization (2019) [paper]
- Luo et al. "Differentiable Learning-to-Normalize via Switchable Normalization"
Weight Standardization (2019.3) [paper]
- Qiao et al. "Weight Standardization"

Initializations

Xavier Initialization (2010) [paper]
- Glorot et al., "Understanding the difficulty of training deep feedforward neural networks"
Kaiming (He) Initialization (2015.2) [paper]
- He et al., "Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification"
All you need is a good init (2015.11) [paper]
- Mishkin et al., "All you need is a good init"
All you need is beyond a good init (2017.4) [paper]
- Xie et al. "All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation"

Dropouts

Dropout (2014) [paper]
- Srivastava et al. "Dropout: A Simple Way to Prevent Neural Networks from Overfitting"
Inverted Dropouts [notes on CS231n]
- Multiplying the inverted keep_prob value on training so that values during inference (or testing) is consistent.
Li et al., "Understanding the Disharmony between Dropout and Batch Normalization by Variance Shift" (2018.1) [paper]

Meta-Learning / Representation Learning (Zero-Shot learning, Few-Shot learning)

Transfer learning

Survey 2018 (2018) [paper]
- Tan et al. "A Survey on Deep Transfer Learning"

Geometric learning

Geometric Deep Learning (2016) [paper]
- Bronstein et al. "Geometric deep learning: going beyond Euclidean data"

Variational Autoencoders (VAE)

VQ-VAE (2017.11) [paper]
- van den Oord et al., "Neural Discrete Representation Learning"
Semi-Amortized Variational Autoencoders (2018.2) [paper]
- Kim et al. "Semi-Amortized Variational Autoencoders"

Object detection

RCNN: https://arxiv.org/abs/1311.2524
Fast-RCNN: https://arxiv.org/abs/1504.08083
Faster-RCNN: https://arxiv.org/abs/1506.01497
SSD: https://arxiv.org/abs/1512.02325
YOLO: https://arxiv.org/abs/1506.02640
YOLO9000: https://arxiv.org/abs/1612.08242

Semantic Segmentation

FCN: https://arxiv.org/abs/1411.4038
SegNet: https://arxiv.org/abs/1511.00561
UNet: https://arxiv.org/abs/1505.04597
PSPNet: https://arxiv.org/abs/1612.01105
DeepLab: https://arxiv.org/abs/1606.00915
ICNet: https://arxiv.org/abs/1704.08545
ENet: https://arxiv.org/abs/1606.02147
Nice survey

Sequential Model

Seq2Seq (2014) [paper]
- Sutskever et al. "Sequence to sequence learning with neural networks."

Neural Turing Machine

Neural Turing Machines (2014) [paper]
- Graves et al., "Neural turing machines."
Pointer Networks (2015) [paper]]
- Vinyals et al., "Pointer networks."

Attention / Question-Answering

NMT (Neural Machine Translation) (2014) [paper]
- Bahdanau et al, "Neural Machine Translation by Jointly Learning to Align and Translate"
Stanford Attentive Reader (2016.6) [paper]
- Chen et al. "A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task"
BiDAF (2016.11) [paper]
- Seo et al. "Bidirectional Attention Flow for Machine Comprehension"
DrQA or Stanford Attentive Reader++ (2017.3) [paper]
- Chen et al. "Reading Wikipedia to Answer Open-Domain Questions"
Transformer (2017.8) [paper] [google ai blog]
- Vaswani et al. "Attention is all you need"
[read] Lilian Weng - "Attention? Attention!" (2018) [blog_post]
- A nice explanation of attention mechanism and its concepts.
BERT (2018.10) [paper]
- Devlin et al., "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
GPT-2 (2019) [paper (pdf)]
- Radford et al. "Language Models are Unsupervised Multitask Learners"

Advanced RNNs

Unitary evolution RNNs : https://arxiv.org/abs/1511.06464
Recurrent Batch Norm : https://arxiv.org/abs/1603.09025
Zoneout : https://arxiv.org/abs/1606.01305
IndRNN : https://arxiv.org/abs/1803.04831
DilatedRNNs : https://arxiv.org/abs/1710.02224

Model Compression

MobileNet (2016) (see above: Basic CNN Architectures)
ShuffleNet (2017)
- Zhang et al. "ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices"

Neural Processes

Neural Processes (2018) [paper]
- Garnelo et al. "Neural Processes"
Attentive Neural Processes (2019) [paper]
- Kim et al. "Attentive Neural Processes"
A Visual Exploration of Gaussian Processes (2019) [Distill.pub]
- Not a neural process, but gives very nice intuition about Gaussian Processes. Good Read.

Self-supervised learning

Data Augmentation

Shake Shake Regularization (2017.5) [paper]
- Gastaldi, Xavier - "Shake-Shake Regularization"

Interpretation and Theory on Generalization, Overfitting, and Learning Capacity

Adversarial Attacks and Defense against attacks (RobustML)

RobustML site
Adversarial Examples Szegedy et al. - Intreguing Properties of Neural Networks (2013.12) [paper]
- induces missclassification by applying small perturbations
- this paper was the first to coin the term "Adversarial Example"
Fast Gradient Sign Attack (FGSM) (2014.12)
- Goodfellow et al., "Explaining and Harnessing Adversarial Examples" (ICLR 2015) [paper]
- This paper presented the famous "panda example" (as also seen in pytorch tutorial)
Kurakin et al., "Adversarial Machine Learning at Scale" (2016.11) [paper]
Mandry et al., "Towards Deep Learning Models Resistant to Adversarial Attacks" (2017.6) [paper]
Carlini et al., "Audio Adversarial Examples: Targeted Attacks on Speech-to-Text" (2018.1) [paper]

Neural architecture search (NAS) and AutoML

GREAT AutoML Website [site]
- They maintain a blog, a list of NAS literatures, analysis page, and a web book.
AdaNet (2016.7) [paper] [GoogleAI blog]
- Cortes et al. "AdaNet: Adaptive Structural Learning of Artificial Neural Networks"
NAS (2016.12) [paper]
- Zoph et al. "Neural Architecture Search with Reinforcement Learning"
PNAS (2017.12) [paper]
- Liu et al. "Progressive Neural Architecture Search"
ENAS (2018.2) [paper]
- Pham et al. "Efficient Neural Architecture Search via Parameter Sharing"
DARTS (2018.6) [paper]
- Liu et al. "DARTS: Differentiable Architecture Search"
- Uses a continuous relaxation over the discrete neural architecture space.
RandWire (2019) [paper]
- Xie et al. "Exploring Randomly Wired Neural Networks for Image Recognition" [Facebook AI Research]
A Survey on Neural Architecture Search (2019) [paper]
- Witsuba et al., "A Survey on Neural Architecture Search"

Practical Techniques

Andrej Karpathy - "A recipe for training neural networks" (2019) [Andrej Karpathy Blog Post]

DL roadmap reference

https://github.com/songrotek/Deep-Learning-Papers-Reading-Roadmap
https://github.com/terryum/awesome-deep-learning-papers
which DL algorithms should I implement to learn? https://www.reddit.com/r/MachineLearning/comments/8vmuet/d_what_deep_learning_papers_should_i_implement_to/

Theory

Resources

A Selective Overview of Deep Learning (2019) [paper]
- Fan et al. "A Selective Overview of Deep Learning"
- A nice overview paper on deep learning up to early 2019 (about 30 pages)

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

dansuh17 / deep-learning-roadmap

Labels

Projects that are alternatives of or similar to deep-learning-roadmap

Deep Learning Roadmap

Introductory Courses

Basic CNN Architectures

Generative adversarial networks

Advanced GANs

Autoencoders

Autoregressive models

Layer Normalizations

Initializations

Dropouts

Meta-Learning / Representation Learning (Zero-Shot learning, Few-Shot learning)

Transfer learning

Geometric learning

Variational Autoencoders (VAE)

Object detection

Semantic Segmentation

Sequential Model

Neural Turing Machine

Attention / Question-Answering

Advanced RNNs

Model Compression

Neural Processes

Self-supervised learning

Data Augmentation

Interpretation and Theory on Generalization, Overfitting, and Learning Capacity

Adversarial Attacks and Defense against attacks (RobustML)

Neural architecture search (NAS) and AutoML

Practical Techniques

DL roadmap reference

Theory

Resources