Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → cedrickchee → Capsule Net Pytorch

cedrickchee / Capsule Net Pytorch

Licence: other

[NO MAINTENANCE INTENDED] A PyTorch implementation of CapsNet architecture in the NIPS 2017 paper "Dynamic Routing Between Capsules".

Programming Languages

139335 projects - #7 most used programming language

Labels

pytorch neural-networks capsule-network capsnet

Projects that are alternatives of or similar to Capsule Net Pytorch

Variational Capsule Routing

Official Pytorch code for (AAAI 2020) paper "Capsule Routing via Variational Bayes", https://arxiv.org/pdf/1905.11455.pdf

Stars: ✭ 84 (-46.84%)

Mutual labels: neural-networks, capsule-network, capsnet

Capsnet Traffic Sign Classifier

A Tensorflow implementation of CapsNet(Capsules Net) apply on german traffic sign dataset

Stars: ✭ 166 (+5.06%)

Mutual labels: capsule-network, capsnet

A Keras implementation of CapsNet in NIPS2017 paper "Dynamic Routing Between Capsules". Now test error ＝ 0.34%.

Stars: ✭ 2,428 (+1436.71%)

Mutual labels: capsule-network, capsnet

CapsNet-tensorflow-jupyter

A simple tensorflow implementation of CapsNet (by Dr. G. Hinton), based on my understanding. This repository is built with an aim to simplify the concept, implement and understand it.

Stars: ✭ 16 (-89.87%)

Mutual labels: capsnet, capsule-network

capsules-tensorflow

Another implementation of Hinton's capsule networks in tensorflow.

Stars: ✭ 18 (-88.61%)

Mutual labels: capsnet, capsule-network

Capsnet Tensorflow

A Tensorflow implementation of CapsNet(Capsules Net) in paper Dynamic Routing Between Capsules

Stars: ✭ 3,776 (+2289.87%)

Mutual labels: capsule-network, capsnet

Empirical studies on Capsule Network representation and improvements implemented with PyTorch.

Stars: ✭ 39 (-75.32%)

Mutual labels: capsnet, capsule-network

CapsLayer: An advanced library for capsule theory

Stars: ✭ 351 (+122.15%)

Mutual labels: capsule-network, capsnet

Capsnet Visualization

🎆 A visualization of the CapsNet layers to better understand how it works

Stars: ✭ 371 (+134.81%)

Mutual labels: capsule-network, capsnet

Awesome Capsule Networks

A curated list of awesome resources related to capsule networks

Stars: ✭ 896 (+467.09%)

Mutual labels: neural-networks, capsule-network

Code for my Master thesis on "Capsule Architecture as a Discriminator in Generative Adversarial Networks".

Stars: ✭ 120 (-24.05%)

Mutual labels: capsule-network, capsnet

🛠 All-in-one web-based IDE specialized for machine learning and data science.

Stars: ✭ 2,337 (+1379.11%)

Mutual labels: neural-networks

Uncertainty Metrics

An easy-to-use interface for measuring uncertainty and robustness.

Stars: ✭ 145 (-8.23%)

Mutual labels: neural-networks

This repository contains the code of LiviaNET, a 3D fully convolutional neural network that was employed in our work: "3D fully convolutional networks for subcortical segmentation in MRI: A large-scale study"

Stars: ✭ 143 (-9.49%)

Mutual labels: neural-networks

Enhancenet Code

EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis (official repository)

Stars: ✭ 142 (-10.13%)

Mutual labels: neural-networks

Machine Learning inference engine for Microcontrollers and Embedded devices

Stars: ✭ 154 (-2.53%)

Mutual labels: neural-networks

A powerful and intuitive WYSIWYG interface that allows anyone to create Machine Learning models!

Stars: ✭ 1,818 (+1050.63%)

Mutual labels: neural-networks

Lacmus is a cross-platform application that helps to find people who are lost in the forest using computer vision and neural networks.

Stars: ✭ 142 (-10.13%)

Mutual labels: neural-networks

A collection of common algorithms and data structures implemented in java, c++, and python.

Stars: ✭ 142 (-10.13%)

Mutual labels: neural-networks

A PyTorch implementation of CapsNet based on NIPS 2017 paper "Dynamic Routing Between Capsules"

Stars: ✭ 141 (-10.76%)

Mutual labels: capsnet

View All Similar Projects ➔

PyTorch CapsNet: Capsule Network for PyTorch

A CUDA-enabled PyTorch implementation of CapsNet (Capsule Network) based on this paper: Sara Sabour, Nicholas Frosst, Geoffrey E Hinton. Dynamic Routing Between Capsules. NIPS 2017

The current test error is 0.21% and the best test error is 0.20%. The current test accuracy is 99.31% and the best test accuracy is 99.32%.

What is a Capsule

A Capsule is a group of neurons whose activity vector represents the instantiation parameters of a specific type of entity such as an object or object part.

You can learn more about Capsule Networks here.

Why another CapsNet implementation?

I wanted a decent PyTorch implementation of CapsNet and I couldn't find one at the point when I started. The goal of this implementation is focus to help newcomers learn and understand the CapsNet architecture and the idea of Capsules. The implementation is NOT focus on rigorous correctness of the results. In addition, the codes are not optimized for speed. To help us read and understand the codes easier, the codes comes with ample comments and the Python classes and functions are documented with Python docstring.

I will try my best to check and fix issues reported. Contributions are highly welcomed. If you find any bugs or errors in the codes, please do not hesitate to open an issue or a pull request. Thank you.

Status and Latest Updates:

See the CHANGELOG

Datasets

The model was trained on the standard MNIST data.

Note: you don't have to manually download, preprocess, and load the MNIST dataset as TorchVision will take care of this step for you.

I have tried using other datasets. See the Other Datasets section below for more details.

Requirements

Python 3
- Tested with version 3.6.4
PyTorch
- Tested with version 0.3.0.post4
- Migrate existing code to work in version 0.4.0. [Work-In-Progress]
- Code will not run with version 0.1.2 due to keepdim not available in this version.
- Code will not run with version 0.2.0 due to softmax function doesn't takes a dimension.
CUDA 8 and above
- Tested with CUDA 8 and CUDA 9.
TorchVision
tensorboardX
tqdm

Usage

Training and Evaluation

Step 1. Clone this repository with git and install project dependencies.

$ git clone https://github.com/cedrickchee/capsule-net-pytorch.git
$ cd capsule-net-pytorch
$ pip install -r requirements.txt

Step 2. Start the CapsNet on MNIST training and evaluation:

Training with default settings:

$ python main.py

Training on 8 GPUs with 30 epochs and 1 routing iteration:

$ CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python main.py --epochs 30 --num-routing 1 --threads 16 --batch-size 128 --test-batch-size 128

Step 3. Test a pre-trained model:

If you have trained a model in Step 2 above, then the weights for the trained model will be saved to results/trained_model/model_epoch_10.pth. [WIP] Now just run the following command to get test results.

$ python main.py --is-training 0 --weights results/trained_model/model_epoch_10.pth

Pre-trained Model

You can download the weights for the pre-trained model from my Google Drive. We saved the weights (model state dict) and the optimizer state for the model at the end of every training epoch.

Weights from epoch 50 checkpoint [84 MB].
Weights from epoch 40 to 50 checkpoints [928 MB].

Uncompress and put the weights (.pth files) into ./results/trained_model/.

Note: the model was last trained on 2017-11-26 and the weights last updated on 2017-11-28.

The Default Hyper Parameters

Parameter	Value	CLI arguments
Training epochs	10	--epochs 10
Learning rate	0.01	--lr 0.01
Training batch size	128	--batch-size 128
Testing batch size	128	--test-batch-size 128
Log interval	10	--log-interval 10
Disables CUDA training	false	--no-cuda
Num. of channels produced by the convolution	256	--num-conv-out-channel 256
Num. of input channels to the convolution	1	--num-conv-in-channel 1
Num. of primary unit	8	--num-primary-unit 8
Primary unit size	1152	--primary-unit-size 1152
Num. of digit classes	10	--num-classes 10
Output unit size	16	--output-unit-size 16
Num. routing iteration	3	--num-routing 3
Use reconstruction loss	true	--use-reconstruction-loss
Regularization coefficient for reconstruction loss	0.0005	--regularization-scale 0.0005
Dataset name (mnist, cifar10)	mnist	--dataset mnist
Input image width to the convolution	28	--input-width 28
Input image height to the convolution	28	--input-height 28

Results

Test Error

CapsNet classification test error on MNIST. The MNIST average and standard deviation results are reported from 3 trials.

The results can be reproduced by running the following commands.

 python main.py --epochs 50 --num-routing 1 --use-reconstruction-loss no --regularization-scale 0.0       #CapsNet-v1
 python main.py --epochs 50 --num-routing 1 --use-reconstruction-loss yes --regularization-scale 0.0005   #CapsNet-v2
 python main.py --epochs 50 --num-routing 3 --use-reconstruction-loss no --regularization-scale 0.0       #CapsNet-v3
 python main.py --epochs 50 --num-routing 3 --use-reconstruction-loss yes --regularization-scale 0.0005   #CapsNet-v4

Method	Routing	Reconstruction	MNIST (%)	Paper
Baseline	--	--	--	0.39
CapsNet-v1	1	no	--	0.34 (0.032)
CapsNet-v2	1	yes	--	0.29 (0.011)
CapsNet-v3	3	no	--	0.35 (0.036)
CapsNet-v4	3	yes	0.21	0.25 (0.005)

Training Loss and Accuracy

The training losses and accuracies for CapsNet-v4 (50 epochs, 3 routing iteration, using reconstruction, regularization scale of 0.0005):

Training accuracy. Highest training accuracy: 100%

Training loss. Lowest training error: 0.1938%

Test Loss and Accuracy

The test losses and accuracies for CapsNet-v4 (50 epochs, 3 routing iteration, using reconstruction, regularization scale of 0.0005):

Test accuracy. Highest test accuracy: 99.32%

Test loss. Lowest test error: 0.2002%

Training Speed

Around 5.97s / batch or 8min / epoch on a single Tesla K80 GPU with batch size of 704.
Around 3.25s / batch or 25min / epoch on a single Tesla K80 GPUwith batch size of 128.

In my case, these are the hyperparameters I used for the training setup:

batch size: 128
Epochs: 50
Num. of routing: 3
Use reconstruction loss: yes
Regularization scale for reconstruction loss: 0.0005

Reconstruction

The results of CapsNet-v4.

Digits at left are reconstructed images.

[WIP] Ground truth image from dataset

Model Design

Model architecture:
------------------

Net (
  (conv1): ConvLayer (
    (conv0): Conv2d(1, 256, kernel_size=(9, 9), stride=(1, 1))
    (relu): ReLU (inplace)
  )
  (primary): CapsuleLayer (
    (conv_units): ModuleList (
      (0): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (1): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (2): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (3): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (4): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (5): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (6): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
      (7): Conv2d(256, 32, kernel_size=(9, 9), stride=(2, 2))
    )
  )
  (digits): CapsuleLayer (
  )
  (decoder): Decoder (
    (fc1): Linear (160 -> 512)
    (fc2): Linear (512 -> 1024)
    (fc3): Linear (1024 -> 784)
    (relu): ReLU (inplace)
    (sigmoid): Sigmoid ()
  )
)

Parameters and size:
-------------------

conv1.conv0.weight: [256, 1, 9, 9]
conv1.conv0.bias: [256]
primary.conv_units.0.weight: [32, 256, 9, 9]
primary.conv_units.0.bias: [32]
primary.conv_units.1.weight: [32, 256, 9, 9]
primary.conv_units.1.bias: [32]
primary.conv_units.2.weight: [32, 256, 9, 9]
primary.conv_units.2.bias: [32]
primary.conv_units.3.weight: [32, 256, 9, 9]
primary.conv_units.3.bias: [32]
primary.conv_units.4.weight: [32, 256, 9, 9]
primary.conv_units.4.bias: [32]
primary.conv_units.5.weight: [32, 256, 9, 9]
primary.conv_units.5.bias: [32]
primary.conv_units.6.weight: [32, 256, 9, 9]
primary.conv_units.6.bias: [32]
primary.conv_units.7.weight: [32, 256, 9, 9]
primary.conv_units.7.bias: [32]
digits.weight: [1, 1152, 10, 16, 8]
decoder.fc1.weight: [512, 160]
decoder.fc1.bias: [512]
decoder.fc2.weight: [1024, 512]
decoder.fc2.bias: [1024]
decoder.fc3.weight: [784, 1024]
decoder.fc3.bias: [784]

Total number of parameters on (with reconstruction network): 8227088 (8 million)

TensorBoard

We logged the training and test losses and accuracies using tensorboardX. TensorBoard helps us visualize how the machine learn over time. We can visualize statistics, such as how the objective function is changing or weights or accuracy varied during training.

TensorBoard operates by reading TensorFlow data (events files).

How to Use TensorBoard

Download a copy of the events files for the latest run from my Google Drive.
Uncompress the file and put it into ./runs.
Check to ensure you have installed tensorflow (CPU version). We need this for TensorBoard server and dashboard.
Start TensorBoard.

$ tensorboard --logdir runs

Open TensorBoard dashboard in your web browser using this URL: http://localhost:6006

Other Datasets

CIFAR10

In the spirit of experiment, I have tried using other datasets. I have updated the implementation so that it supports and works with CIFAR10. Need to note that I have not tested throughly our capsule model on CIFAR10.

Here's how we can train and test the model on CIFAR10 by running the following commands.

python main.py --dataset cifar10 --num-conv-in-channel 3 --input-width 32 --input-height 32 --primary-unit-size 2048 --epochs 80 --num-routing 1 --use-reconstruction-loss yes --regularization-scale 0.0005

Training Loss and Accuracy

The training losses and accuracies for CapsNet-v4 (80 epochs, 3 routing iteration, using reconstruction, regularization scale of 0.0005):

Highest training accuracy: 100%
Lowest training error: 0.3589%

Test Loss and Accuracy

The test losses and accuracies for CapsNet-v4 (80 epochs, 3 routing iteration, using reconstruction, regularization scale of 0.0005):

Highest test accuracy: 71%
Lowest test error: 0.5735%

TODO

[x] Publish results.
[x] More testing.
[ ] Inference mode - command to test a pre-trained model.
[ ] Jupyter Notebook version.
[x] Create a sample to show how we can apply CapsNet to real-world application.
[ ] Experiment with CapsNet:
- [x] Try using another dataset.
- [ ] Come out a more creative model structure.
[x] Pre-trained model and weights.
[x] Add visualization for training and evaluation metrics.
[x] Implement recontruction loss.
[x] Check algorithm for correctness.
[x] Update results from TensorBoard after making improvements and bug fixes.
[x] Publish updated pre-trained model weights.
[x] Log the original and reconstructed images using TensorBoard.
[ ] Update results with reconstructed image and original image.
[ ] Resume training by loading model checkpoint.
[ ] Migrate existing code to work in PyTorch 0.4.0.

WIP is an acronym for Work-In-Progress

Credits

Referenced these implementations mainly for sanity check:

TensorFlow implementation by @naturomics

Learning Resources

Here's some resources that we think will be helpful if you want to learn more about Capsule Networks:

Other Implementations

TensorFlow:
- The first author of the paper, Sara Sabour has released the code.

Real-world Application of CapsNet

The following is a few samples in the wild that show how we can apply CapsNet to real-world use cases.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 158

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (7) 🔗