All Projects → oyxhust → Ssd Text_detection

oyxhust / Ssd Text_detection

A modified SSD model for text detection

Labels

Projects that are alternatives of or similar to Ssd Text detection

Kodi Standalone Service
A systemd service to allow for standalone operation of kodi.
Stars: ✭ 83 (-8.79%)
Mutual labels:  makefile
Crab project
ROS hexapod robot on BeagleBone Black
Stars: ✭ 85 (-6.59%)
Mutual labels:  makefile
Animation Worklet
🚫 Old repository for AnimationWorklet specification ➡️ New repository: https://github.com/w3c/css-houdini-drafts
Stars: ✭ 89 (-2.2%)
Mutual labels:  makefile
Riscv Sbi Doc
Documentation for the RISC-V Supervisor Binary Interface
Stars: ✭ 84 (-7.69%)
Mutual labels:  makefile
Firmware
Armbian firmware
Stars: ✭ 85 (-6.59%)
Mutual labels:  makefile
Value Investing Newbie
Stars: ✭ 87 (-4.4%)
Mutual labels:  makefile
Wiki
Archive of free60.org mediawiki
Stars: ✭ 83 (-8.79%)
Mutual labels:  makefile
Flash2cocos2d X
use this tool you can export the flash data, and use the data in cocos2d-x game
Stars: ✭ 90 (-1.1%)
Mutual labels:  makefile
Sinetek Rtsx
macOS driver for Realtek SD card readers.
Stars: ✭ 85 (-6.59%)
Mutual labels:  makefile
Android device xiaomi gemini
Stars: ✭ 89 (-2.2%)
Mutual labels:  makefile
Ergodone
ErgoDox using pro micro. Original work by Dox. Brainhole association present
Stars: ✭ 84 (-7.69%)
Mutual labels:  makefile
Passport Strategy
An abstract class implementing Passport's strategy API.
Stars: ✭ 84 (-7.69%)
Mutual labels:  makefile
Wayland Protocols
Wayland protocol development
Stars: ✭ 87 (-4.4%)
Mutual labels:  makefile
Device Sony Yuga
Stars: ✭ 83 (-8.79%)
Mutual labels:  makefile
Apk File
Search apk package contents via the command line.
Stars: ✭ 89 (-2.2%)
Mutual labels:  makefile
Sqlite3 Android
SQLite CLI and Library build scripts for Android
Stars: ✭ 83 (-8.79%)
Mutual labels:  makefile
Make Handbook
Handbook about modern make usage
Stars: ✭ 85 (-6.59%)
Mutual labels:  makefile
Local Dev With Docker For Mac Kubernetes
Notes about local development with Docker for Mac and Kubernetes
Stars: ✭ 91 (+0%)
Mutual labels:  makefile
Drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+1329.67%)
Mutual labels:  makefile
Cardano Tutorials
ARCHIVED-This content in this repository is now located at https://docs.cardano.org/projects/cardano-node/
Stars: ✭ 89 (-2.2%)
Mutual labels:  makefile

SSD-text detection: Text Detector

This is a modified SSD model for text detection.

Compared to faster R-CNN, SSD is much faster. In my expriment, SSD only needs about 0.05s for each image.

Disclaimer

This is a re-implementation of mxnet SSD. The official repository is available here. The arXiv paper is available here.

Getting started

  • Build MXNet: Make sure the extra operators for this example is enabled, and please following the the official instructions here.

Train the model

I modify the original SSD on SynthText and ICDAR. Other datasets should be easily supported by adding subclass derived from class Imdb in dataset/imdb.py. See example of dataset/pascal_voc.py for details.

  • Download the converted pretrained vgg16_reduced model here, unzip .param and .json files into model/ directory by default.

To gain a good performance, we should train our model on SynthText which is a quite big dataset (about 40G) firstly, and then fine tune this model on ICDAR. If you want to apply this model for other applications, you can fine tune it on any dataset.

  • Download the SynthText dataset here, and extract it into data.

Because SSD requires every image's size but SythText is too big, it will take too much time if we have to use opencv to read the images' size each time when we star training. So I use 'read_size.py' (data/synthtext_img_size) to creat a h5py file 'size.h5' to store the sizes of all images. You can copy this file to the extracted folder 'SynthText'.

  • Start training:
python train_synthtext.py

Fine tune the model

  • Download the ICDAR challenge 2 dataset here, and extract it into data.

  • Start training:

python train_icdar.py --finetune N

Please replace 'N' into an integer number which depends on the save model you train on SynthText.

Try the demo

  • After training, you can try your model on test images. I give two demos here (demo.py and demo_savefig.py). demo.py can visualize the detection result, while demo_savefig.py can save the detection result as images.

When running demo_savefig.py, please give the test images path.

  • Run demo.py
# play with examples:
python demo.py --epoch 0 --images ./data/demo/test.jpg --thresh 0.5
  • Check python demo.py --help for more options.

When running demo_savefig.py, please give the test images folder path.

  • Run demo_savefig.py
# play with examples:
python demo_savefig.py --epoch 0 --images ./data/demo/test --thresh 0.5
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].