All Projects → DrSleep → Tensorflow Deeplab Lfov

DrSleep / Tensorflow Deeplab Lfov

Licence: mit
DeepLab-LargeFOV implemented in tensorflow

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Tensorflow Deeplab Lfov

Pytorch Deeplab
DeepLab-ResNet rebuilt in Pytorch
Stars: ✭ 254 (+16.51%)
Mutual labels:  semantic-segmentation, pascal-voc
Tensorflow Deeplab V3
DeepLabv3 built in TensorFlow
Stars: ✭ 278 (+27.52%)
Mutual labels:  semantic-segmentation, pascal-voc
DST-CBC
Implementation of our paper "DMT: Dynamic Mutual Training for Semi-Supervised Learning"
Stars: ✭ 98 (-55.05%)
Mutual labels:  semantic-segmentation, pascal-voc
FCN-Segmentation-TensorFlow
FCN for Semantic Image Segmentation achieving 68.5 mIoU on PASCAL VOC
Stars: ✭ 34 (-84.4%)
Mutual labels:  semantic-segmentation, pascal-voc
Fcn Googlenet
GoogLeNet implementation of Fully Convolutional Networks for Semantic Segmentation in TensorFlow
Stars: ✭ 45 (-79.36%)
Mutual labels:  semantic-segmentation, pascal-voc
Edgenets
This repository contains the source code of our work on designing efficient CNNs for computer vision
Stars: ✭ 331 (+51.83%)
Mutual labels:  semantic-segmentation, pascal-voc
Deeplab V2 Resnet 101 Tensorflow
An (re-)implementation of DeepLab v2 (ResNet-101) in TensorFlow for semantic image segmentation on the PASCAL VOC 2012 dataset.
Stars: ✭ 173 (-20.64%)
Mutual labels:  semantic-segmentation, pascal-voc
Chainer Pspnet
PSPNet in Chainer
Stars: ✭ 76 (-65.14%)
Mutual labels:  semantic-segmentation, pascal-voc
Pytorch Auto Drive
Segmentation models (ERFNet, ENet, DeepLab, FCN...) and Lane detection models (SCNN, SAD, PRNet, RESA, LSTR...) based on PyTorch 1.6 with mixed precision training
Stars: ✭ 32 (-85.32%)
Mutual labels:  semantic-segmentation, pascal-voc
Pytorch Deeplab Resnet
DeepLab resnet v2 model in pytorch
Stars: ✭ 584 (+167.89%)
Mutual labels:  semantic-segmentation, pascal-voc
Tensorflow Deeplab Resnet
DeepLab-ResNet rebuilt in TensorFlow
Stars: ✭ 1,198 (+449.54%)
Mutual labels:  semantic-segmentation, pascal-voc
Dsrg
Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).
Stars: ✭ 206 (-5.5%)
Mutual labels:  semantic-segmentation, pascal-voc
Eye In The Sky
Satellite Image Classification using semantic segmentation methods in deep learning
Stars: ✭ 185 (-15.14%)
Mutual labels:  semantic-segmentation
Semantic Segmentation Suite
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
Stars: ✭ 2,395 (+998.62%)
Mutual labels:  semantic-segmentation
Fcn Semantic Segmentation
Fully convolutional networks for semantic segmentation
Stars: ✭ 181 (-16.97%)
Mutual labels:  semantic-segmentation
Torchdistill
PyTorch-based modular, configuration-driven framework for knowledge distillation. 🏆18 methods including SOTA are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy.
Stars: ✭ 177 (-18.81%)
Mutual labels:  semantic-segmentation
Intrada
Unsupervised Intra-domain Adaptation for Semantic Segmentation through Self-Supervision (CVPR 2020 Oral)
Stars: ✭ 211 (-3.21%)
Mutual labels:  semantic-segmentation
Fast Semantic Segmentation
ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation
Stars: ✭ 203 (-6.88%)
Mutual labels:  semantic-segmentation
Imgclsmob
Sandbox for training deep learning networks
Stars: ✭ 2,405 (+1003.21%)
Mutual labels:  semantic-segmentation
Cct
[CVPR 2020] Semi-Supervised Semantic Segmentation with Cross-Consistency Training.
Stars: ✭ 171 (-21.56%)
Mutual labels:  semantic-segmentation

DeepLab-TensorFlow

This is an implementation of DeepLab-LargeFOV in TensorFlow for semantic image segmentation on PASCAL VOC dataset.

Model Description

The DeepLab-LargeFOV is built on a fully convolutional variant of the VGG-16 net with several modifications: first, it exploits atrous (dilated) convolutions to increase the field-of-view; second, the number of filters in the last layers is reduced from 4096 to 1024 in order to decrease the memory consumption and the time spent on performing one forward-backward pass; third, it omits the last pooling layers to keep the downsampling ratio of 8.

The model is trained on a mini-batch of images and corresponding ground truth masks with the softmax classifier on the top. During training, the masks are downsampled to match the size of the output from the network; during inference, to acquire the output of the same size as the input, bilinear upsampling is applied. The final segmentation mask is acquired using argmax over unnormalised log scores from the network. Optionally, a fully-connected probabilistic graphical model, namely, CRF, can be applied to refine the final predictions. On the test set of PASCAL VOC, the model shows 70.3% of mean intersection-over-union.

For more details on the underlying model please refer to the following paper:

@article{CP2016Deeplab,
  title={DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs},
  author={Liang-Chieh Chen and George Papandreou and Iasonas Kokkinos and Kevin Murphy and Alan L Yuille},
  journal={arXiv:1606.00915},
  year={2016}
}

Requirements

TensorFlow needs to be installed before running the scripts. TensorFlow>=0.11 is supported.

To install the required python packages (except TensorFlow), run

pip install -r requirements.txt

or for a local installation

pip install -user -r requirements.txt

Caffe to TensorFlow conversion

To imitate the structure of the model, we have used .caffemodel files provided by the authors. The .util/extract_params.py script saves the structure of the network, i.e. the name of the parameters with their corresponding shapes (in TF 'HNWC' format), as well as the weights of those parameters (again, in the TF format). These weights can be used to initialise the variables in the model; otherwise, the filters will be initialised using the Xavier initialisation scheme, and biases will be initiliased as 0s. To use this script you will need to install Caffe. It is optional, and you can download two already converted models (model.ckpt-init and model.ckpt-pretrained) here.

Dataset

To train the network, we use the augmented PASCAL VOC 2012 dataset with 10582 images for training and 1449 images for validation.

Training

We initialised the network from the .caffemodel file provided by the authors. In that model, the last classification layer is randomly initialised using the Xavier scheme with biases set to zeros. The loss function is the pixel-wise softmax loss, and it is optimised using Adam. No weight decay is used.

The train.py script provides an ability to monitor model performance by snapshotting current results: Besides that, one can change the input size and augment data with random scaling.

To see the documentation on each of the training settings run the following:

python train.py --help

Evaluation

After the training, the model shows 57% mIoU on the Pascal VOC 2012 validation dataset. The model initialised from the pre-trained .caffemodel shows 67% mIoU on the same dataset. Note that in the original DeepLab each image is padded so that the input is of size 513x513 and CRF is used, which can be one of the reason of the lower score (~70.3% mIoU).

To see the documentation on each of the evaluation settings run the following:

python evaluate.py --help

Inference

To perform inference over your own images, use the following command:

python inference.py /path/to/your/image /path/to/ckpt/file

This will run the forward pass and save the resulted mask with this colour map:

Missing features

At the moment, the post-processing step with CRF is not implemented. Besides that, the weight decay is missing, as well.

Other implementations

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].