All Projects → IBM → Biglittlenet

IBM / Biglittlenet

Licence: apache-2.0
Official repository for Big-Little Net

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Biglittlenet

Pneumonia Detection From Chest X Ray Images With Deep Learning
Detecting Pneumonia in Chest X-ray Images using Convolutional Neural Network and Pretrained Models
Stars: ✭ 64 (+12.28%)
Mutual labels:  imagenet, image-recognition
Papers
A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented
Stars: ✭ 63 (+10.53%)
Mutual labels:  speech-recognition, image-recognition
Iresnet
Improved Residual Networks (https://arxiv.org/pdf/2004.04989.pdf)
Stars: ✭ 163 (+185.96%)
Mutual labels:  imagenet, image-recognition
BottleneckTransformers
Bottleneck Transformers for Visual Recognition
Stars: ✭ 231 (+305.26%)
Mutual labels:  imagenet, image-recognition
Pyconv
Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition (https://arxiv.org/pdf/2006.11538.pdf)
Stars: ✭ 231 (+305.26%)
Mutual labels:  imagenet, image-recognition
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1322.81%)
Mutual labels:  speech-recognition, image-recognition
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+1673.68%)
Mutual labels:  speech-recognition
Segmentationcpp
A c++ trainable semantic segmentation library based on libtorch (pytorch c++). Backbone: ResNet, ResNext. Architecture: FPN, U-Net, PAN, LinkNet, PSPNet, DeepLab-V3, DeepLab-V3+ by now.
Stars: ✭ 49 (-14.04%)
Mutual labels:  imagenet
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-29.82%)
Mutual labels:  speech-recognition
Voice
🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+1642.11%)
Mutual labels:  speech-recognition
Divide And Co Training
[Paper 2020] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, Wide-ResNet, ResNeXt, ResNeSt, ResNeXSt, SENet, Shake-Shake, DenseNet, PyramidNet, and EfficientNet.
Stars: ✭ 54 (-5.26%)
Mutual labels:  imagenet
Estimated Depth Map Helps Image Classification
Depth estimation with neural network, and learning on RGBD images
Stars: ✭ 52 (-8.77%)
Mutual labels:  image-recognition
Textnormalizationcoveringgrammars
Covering grammars for English and Russian text normalization
Stars: ✭ 46 (-19.3%)
Mutual labels:  speech-recognition
Avsr Deep Speech
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-24.56%)
Mutual labels:  speech-recognition
Pretrained Models.pytorch
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
Stars: ✭ 8,318 (+14492.98%)
Mutual labels:  imagenet
Imagenet resnet tensorflow2.0
Train ResNet on ImageNet in Tensorflow 2.0; ResNet 在ImageNet上完整训练代码
Stars: ✭ 42 (-26.32%)
Mutual labels:  imagenet
Iflytek awaken asr
use iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-7.02%)
Mutual labels:  speech-recognition
Rapiddraw
A simple artificial intelligence experiment to find out if mobile neural networks can recognize human-made doodles
Stars: ✭ 39 (-31.58%)
Mutual labels:  image-recognition
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-17.54%)
Mutual labels:  speech-recognition
Cordova Plugin Tensorflow
On-device image recognition via TensorFlow/Inception. For Cordova/PhoneGap.
Stars: ✭ 51 (-10.53%)
Mutual labels:  image-recognition

BigLittleNet-pytorch

This repository holds the codes and models for the papers.

Chun-Fu (Richard) Chen, Quanfu Fan, Neil Mallinar, Tom Sercu and Rogerio Feris Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

If you use the codes and models from this repo, please cite our work. Thanks!

@inproceedings{
    chen2018biglittle,
    title={{Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition}},
    author={Chun-Fu (Richard) Chen and Quanfu Fan and Neil Mallinar and Tom Sercu and Rogerio Feris},
    booktitle={International Conference on Learning Representations},
    year={2019},
    url={https://openreview.net/forum?id=HJMHpjC9Ym},
}

Dependent library

  1. pytorch >= 1.0.0
  2. tensorboard_logger
  3. tqdm

Or install requirement via:

pip3 install -r requirement.txt

Usage

The training script is mostly borrow from the imagenet example of pytorch/examples with modifications.

Please refer the instructions there to prepare the ImageNet dataset.

Training

Training a bL-ResNeXt-101 (64×4d) (α = 2, β = 4) model with two GPUs (0, 1) and saving logfile the LOGDIR folder

python3 imagenet-train.py --data /path/to/folder -d 101 --basewidth 4 \
--cardinality 64 --backbone_net blresnext --alpha 2 --beta 4 \
--lr_scheduler cosine --logdir LOGDIR --gpu 0,1

Test

After download the models, put in the pretrained folder. Evaluating the bL-ResNeXt-101 (64×4d) (α = 2, β = 4) model with two GPUs.

python3 imagenet-train.py --data /path/to/folder -d 101 --basewidth 4 \
--cardinality 64 --backbone_net blresnext --alpha 2 --beta 4 --evaluate \
--gpu 0,1 --pretrained

Please feel free to raise issue if you encounter issue when using the pretrained models.

Results and Models

After the submission, we re-train our models on PyTorch with the same setting described in the paper.

Performance of Big-Little Net models (evaluating on a single 224x224 image.)

Model Top-1 Error FLOPs (10^9)
bLResNet-50 (α = 2, β = 4) 22.41% 2.85
bLResNet-101 (α = 2, β = 4) 21.34% 3.89
bLResNeXt-50 (32x4d) (α = 2, β = 4) 21.62% 3.03
bLResNeXt-101 (32x4d) (α = 2, β = 4) 20.87% 4.08
bLResNeXt-101 (64x4d) (α = 2, β = 4) 20.34% 7.97
bLSEResNeXt-50 (32x4d) (α = 2, β = 4) 21.44% 3.03
bLSEResNeXt-101 (32x4d) (α = 2, β = 4) 21.04% 4.08
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].