All Projects → Fengdalu → Lipreading Densenet3d

Fengdalu / Lipreading Densenet3d

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Lipreading Densenet3d

Deepfacelab
DeepFaceLab is the leading software for creating deepfakes.
Stars: ✭ 30,308 (+33205.49%)
Mutual labels:  deeplearning, arxiv
Cnn Paper2
🎨 🎨 深度学习 卷积神经网络教程 :图像识别,目标检测,语义分割,实例分割,人脸识别,神经风格转换,GAN等🎨🎨 https://dataxujing.github.io/CNN-paper2/
Stars: ✭ 77 (-15.38%)
Mutual labels:  deeplearning, arxiv
Lipnet Pytorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
Stars: ✭ 104 (+14.29%)
Mutual labels:  deeplearning, arxiv
Arxiv Vanity
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
Stars: ✭ 1,190 (+1207.69%)
Mutual labels:  arxiv
Auto car
基于深度学习的自动避障智能小车
Stars: ✭ 74 (-18.68%)
Mutual labels:  deeplearning
Deeplearning
Neural network base on c++14, support any number of layers 基于C++14元编程的深度学习神经网络模板类,支持任意层数
Stars: ✭ 80 (-12.09%)
Mutual labels:  deeplearning
Wb color augmenter
WB color augmenter improves the accuracy of image classification and image semantic segmentation methods by emulating different WB effects (ICCV 2019) [Python & Matlab].
Stars: ✭ 89 (-2.2%)
Mutual labels:  deeplearning
Udacity Natural Language Processing Nanodegree
Tutorials and my solutions to the Udacity NLP Nanodegree
Stars: ✭ 73 (-19.78%)
Mutual labels:  deeplearning
Awesome Vln
A curated list of research papers in Vision-Language Navigation (VLN)
Stars: ✭ 86 (-5.49%)
Mutual labels:  arxiv
Skip Thought Tf
An implementation of skip-thought vectors in Tensorflow
Stars: ✭ 77 (-15.38%)
Mutual labels:  deeplearning
Tf Mobilenet V2
Mobilenet V2(Inverted Residual) Implementation & Trained Weights Using Tensorflow
Stars: ✭ 85 (-6.59%)
Mutual labels:  deeplearning
Datamining
数据挖掘开源书
Stars: ✭ 76 (-16.48%)
Mutual labels:  deeplearning
Mit Deep Learning
Tutorials, assignments, and competitions for MIT Deep Learning related courses.
Stars: ✭ 8,912 (+9693.41%)
Mutual labels:  deeplearning
Flops Counter.pytorch
Flops counter for convolutional networks in pytorch framework
Stars: ✭ 1,223 (+1243.96%)
Mutual labels:  deeplearning
Deep learning for biologists with keras
tutorials made for biologists to learn deep learning
Stars: ✭ 74 (-18.68%)
Mutual labels:  deeplearning
Novel Deep Learning Model For Traffic Sign Detection Using Capsule Networks
capsule networks that achieves outstanding performance on the German traffic sign dataset
Stars: ✭ 88 (-3.3%)
Mutual labels:  deeplearning
Casual Digressions
💤 Old repository of notes on machine learning papers.
Stars: ✭ 73 (-19.78%)
Mutual labels:  arxiv
Tools To Design Or Visualize Architecture Of Neural Network
Tools to Design or Visualize Architecture of Neural Network
Stars: ✭ 1,143 (+1156.04%)
Mutual labels:  deeplearning
Kamonohashi
AI開発プラットフォームKAMONOHASHI
Stars: ✭ 80 (-12.09%)
Mutual labels:  deeplearning
Pytorch Seq2seq
An open source framework for seq2seq models in PyTorch.
Stars: ✭ 1,297 (+1325.27%)
Mutual labels:  deeplearning

Lipreading-DenseNet3D

DenseNet3D Model In "DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Sample of the proposed LRW-1000

Update

2020-12-10: Please check https://github.com/Fengdalu/learn-an-effective-lip-reading-model-without-pains for our base models, which provides more detailed configuration on LRW and LRW-1000.

Introduction

This respository is implementation of the proposed DenseNet-3D network in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild. Our paper can be found here.

Data Preparation

Download LRW1000 Dataset and place LRW1000_Public in the root of this repository. Instead, you can create symbolic links to this project:

ln -s LRW1000_Public Lipreading-DenseNet3D/LRW1000_Public

Training And Testing

You can train or test the model by running:

python main.py options_lip.toml

Model architecture details and data annotation items are configured in options_lip.toml. Please pay attention that you may need modify the code in options_lip.toml and change the parameters to make the scripts work just as expected.

Dependencies

  • PyTorch 1.0+
  • toml
  • tensorboardX
  • imageio

Reference

If this repository was useful for your research, please cite our work:

@inproceedings{yang2019lrw,
  title={LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild},
  author={Yang, Shuang and Zhang, Yuanhang and Feng, Dalu and Yang, Mingmin and Wang, Chenhao and Xiao, Jingyun and Long, Keyu and Shan, Shiguang and Chen, Xilin},
  booktitle={2019 14th IEEE International Conference on Automatic Face \& Gesture Recognition (FG 2019)},
  pages={1--8},
  year={2019},
  organization={IEEE},
  url={https://github.com/Fengdalu/Lipreading-DenseNet3D}
}

Related Projects

Another implmentation Of DenseNet-3D

Learn an Effective Lip Reading Model without Pains (Strong Recommended)

LipNet-PyTorch (The state-of-the-art PyTorch Version)

End-to-end-lipreading

Lipreading using Temporal Convolutional Networks

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].