All Projects → SURFZJY → Real-time-Text-Detection-DBNet

SURFZJY / Real-time-Text-Detection-DBNet

Licence: Apache-2.0 license
PyTorch re-implementation of ''Real-time Scene Text Detection with Differentiable Binarization'' (AAAI 2020)

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Real-time-Text-Detection-DBNet

awesome-scene-text
A curated list of papers and resources for scene text detection and recognition
Stars: ✭ 43 (-49.41%)
Mutual labels:  text-detection, scene-text-detection
mongodb-backup-manager
🌿 A Full-stack MongoDB Backup System.
Stars: ✭ 42 (-50.59%)
Mutual labels:  real-time
danmaku-player
An HTML5 danmaku video player for real-time image processing using WebGl and Web Components.融合了webgl和web components的实时图像处理弹幕播放器
Stars: ✭ 40 (-52.94%)
Mutual labels:  real-time
ipfs-chat
Real-time P2P messenger using go-ipfs pubsub. TUI. End-to-end encrypted texting & file-sharing. NAT traversal.
Stars: ✭ 84 (-1.18%)
Mutual labels:  real-time
QPPWG
Quasi-Periodic Parallel WaveGAN Pytorch implementation
Stars: ✭ 41 (-51.76%)
Mutual labels:  real-time
nebula
A distributed block-based data storage and compute engine
Stars: ✭ 127 (+49.41%)
Mutual labels:  real-time
aquedux
Redux over the wire
Stars: ✭ 46 (-45.88%)
Mutual labels:  real-time
jeelizGlanceTracker
JavaScript/WebGL lib: detect if the user is looking at the screen or not from the webcam video feed. Lightweight and robust to all lighting conditions. Great for play/pause videos if the user is looking or not, or for person detection. Link to live demo.
Stars: ✭ 68 (-20%)
Mutual labels:  real-time
microbium-app
Draw new worlds
Stars: ✭ 89 (+4.71%)
Mutual labels:  real-time
AttaNet
AttaNet for real-time semantic segmentation.
Stars: ✭ 37 (-56.47%)
Mutual labels:  real-time
FastPose
pytorch realtime multi person keypoint estimation
Stars: ✭ 36 (-57.65%)
Mutual labels:  real-time
hyperseg
HyperSeg - Official PyTorch Implementation
Stars: ✭ 174 (+104.71%)
Mutual labels:  real-time
fos
Interesting project,the Fast Real Time Operating Systems( FOS-RTOS)
Stars: ✭ 22 (-74.12%)
Mutual labels:  real-time
tutorial
Tutorials to help you build your first Swim app
Stars: ✭ 27 (-68.24%)
Mutual labels:  real-time
gdax-orderbook-hpp
An in-memory copy of the order book on the GDAX cryptocurrency exchange, updated in real time via WebSocket feed, exposed in a thread-safe and lock-free data structure.
Stars: ✭ 38 (-55.29%)
Mutual labels:  real-time
ESANet
ESANet: Efficient RGB-D Semantic Segmentation for Indoor Scene Analysis
Stars: ✭ 154 (+81.18%)
Mutual labels:  real-time
SceneGraphFusion
No description or website provided.
Stars: ✭ 82 (-3.53%)
Mutual labels:  real-time
nvidia-jetson-rt
Real-Time Scheduling with NVIDIA Jetson TX2
Stars: ✭ 38 (-55.29%)
Mutual labels:  real-time
ECBSR
Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021
Stars: ✭ 216 (+154.12%)
Mutual labels:  real-time
Real-time-Image-Smoothing-via-Iterative-Least-Squares
This is the released code for the paper Real-time Image Smoothing via Iterative Least Squares accepted to ACM Transactions on Graphics
Stars: ✭ 60 (-29.41%)
Mutual labels:  real-time

Real-time-Text-Detection

PyTorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization

contour

Difference between thesis and this implementation

  1. Use dice loss instead of BCE(binary cross-entropy) loss.

  2. Use normal convolution rather than deformable convolution in the backbone network.

  3. The architecture of the backbone network is a simple FPN.

  4. Have not implement OHEM.

  5. The ground truth of the threshold map is constant 1 rather than 'the distance to the closest segment'.

Introduction

thanks to these project:

The features are summarized blow:

  • Use resnet18/resnet50/shufflenetV2 as backbone.

Contents

  1. Installation
  2. Download
  3. Train
  4. Predict
  5. Eval
  6. Demo

Installation

  1. pytorch 1.1.0

Download

  1. ShuffleNet_V2 Models trained on ICDAR 2013+2015 (training set)

https://pan.baidu.com/s/1Um0wzbTFjJC0jdJ703GR7Q

or https://mega.nz/#!WdhxXAxT!oGURvmbQFqTHu5hljUPdbDMzI75_UO2iWLaXX5dJrDw

Train

  1. modify genText.py to generate txt list file for training/testing data

  2. modify config.json

  3. run

python train.py

Predict

  1. run
python predict.py

Eval

run

python eval.py

Examples

contour

bbox

Todo

  • MobileNet backbone

  • Deformable convolution

  • tensorboard support

  • FPN --> Architecture in the thesis

  • Dice Loss --> BCE Loss

  • threshold map gt use 1 --> threshold map gt use distance (Use 1 will accelerate the label generation)

  • OHEM

  • OpenCV_DNN inference API for CPU machine

  • Caffe version (for deploying with MNN/NCNN)

  • ICDAR13 / ICDAR15 / CTW1500 / MLT2017 / Total-Text

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].