All Projects → MhLiao → TextBoxes

MhLiao / TextBoxes

Licence: other
TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Projects that are alternatives of or similar to TextBoxes

rgbd person tracking
R-GBD Person Tracking is a ROS framework for detecting and tracking people from a mobile robot.
Stars: ✭ 46 (-92.64%)
Mutual labels:  detection, scene
tools-generation-detection-synthetic-content
Compilation of the state of the art of tools, articles, forums and links of interest to generate and detect any type of synthetic content using deep learning.
Stars: ✭ 107 (-82.88%)
Mutual labels:  text, detection
East icpr
Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE
Stars: ✭ 154 (-75.36%)
Mutual labels:  text, detection
Mpitextkit
Powerful text framework for iOS to display rich text based on TextKit, inspired by Texture and YYText.
Stars: ✭ 191 (-69.44%)
Mutual labels:  text
Sharect
🔗 A lightweight JavaScript library to let users share their text selections to social networks.
Stars: ✭ 192 (-69.28%)
Mutual labels:  text
Finalcut
A text-based widget toolkit
Stars: ✭ 244 (-60.96%)
Mutual labels:  text
SilentServer
Silent is very lightweight, high quality - low latency voice chat for gaming. The server runs on Windows and Linux.
Stars: ✭ 52 (-91.68%)
Mutual labels:  text
Cucco
Text normalization library for Python
Stars: ✭ 185 (-70.4%)
Mutual labels:  text
DotGrok
Parse text with pattern. Inspired by grok filter.
Stars: ✭ 26 (-95.84%)
Mutual labels:  text
React Native Text Size
Measure text accurately before laying it out and get font information from your App.
Stars: ✭ 238 (-61.92%)
Mutual labels:  text
Swiftrichstring
👩‍🎨 Elegant Attributed String composition in Swift sauce
Stars: ✭ 2,744 (+339.04%)
Mutual labels:  text
Alyn
Detect and fix skew in images containing text
Stars: ✭ 202 (-67.68%)
Mutual labels:  text
Text
An efficient packed, immutable Unicode text type for Haskell, with a powerful loop fusion optimization framework.
Stars: ✭ 248 (-60.32%)
Mutual labels:  text
Rskplaceholdertextview
A light-weight UITextView subclass that adds support for placeholder.
Stars: ✭ 192 (-69.28%)
Mutual labels:  text
J2N
Java-like Components for .NET
Stars: ✭ 37 (-94.08%)
Mutual labels:  text
Rtltmpro
Right-To-Left Text Mesh Pro for Unity. This plugin adds support for Persian and Arabic languages to TextMeshPro.
Stars: ✭ 187 (-70.08%)
Mutual labels:  text
GPT2-Telegram-Chatbot
GPT-2 Telegram Chat bot
Stars: ✭ 67 (-89.28%)
Mutual labels:  text
Deepsegment
A sentence segmenter that actually works!
Stars: ✭ 211 (-66.24%)
Mutual labels:  text
React Native Text Ticker
React Native Text Ticker/Marquee Component
Stars: ✭ 212 (-66.08%)
Mutual labels:  text
Stubble
Trimmed down {{mustache}} templates in .NET
Stars: ✭ 247 (-60.48%)
Mutual labels:  text

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Recommend: TextBoxes++ is an extended work of TextBoxes, which supports oriented scene text detection. The recognition part is also included in TextBoxes++.

Introduction

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects scene text with both high accuracy and efficiency in a single network forward pass, involving no post-process except for a standard nonmaximum suppression. For more details, please refer to our paper.

Citing TextBoxes

Please cite TextBoxes in your publications if it helps your research:

@inproceedings{LiaoSBWL17,
  author    = {Minghui Liao and
               Baoguang Shi and
               Xiang Bai and
               Xinggang Wang and
               Wenyu Liu},
  title     = {TextBoxes: {A} Fast Text Detector with a Single Deep Neural Network},
  booktitle = {AAAI},
  year      = {2017}
}

Contents

  1. Installation
  2. Download
  3. Test
  4. Train
  5. Performance

Installation

  1. Get the code. We will call the directory that you cloned Caffe into $CAFFE_ROOT
git clone https://github.com/MhLiao/TextBoxes.git

cd TextBoxes

make -j8

make py

Download

  1. Models trained on ICDAR 2013: Dropbox link BaiduYun link
  2. Fully convolutional reduced (atrous) VGGNet: Dropbox link BaiduYun link
  3. Compiled mex file for evaluation(for multi-scale test evaluation: evaluation_nms.m): Dropbox link BaiduYun link

Test

  1. run "python examples/demo.py".
  2. You can modify the "use_multi_scale" in the "examples/demo.py" script to control whether to use multi-scale or not.
  3. The results are saved in the "examples/results/".

Train

  1. Train about 50k iterions on Synthetic data which refered in the paper.
  2. Train about 2k iterions on corresponding training data such as ICDAR 2013 and SVT.
  3. For more information, such as learning rate setting, please refer to the paper.

Performance

  1. Using the given test code, you can achieve an F-measure of about 80% on ICDAR 2013 with a single scale.
  2. Using the given multi-scale test code, you can achieve an F-measure of about 85% on ICDAR 2013 with a non-maximum suppression.
  3. More performance information, please refer to the paper and Task1 and Task4 of Challenge2 on the ICDAR 2015 website: http://rrc.cvc.uab.es/?ch=2&com=evaluation

Data preparation for training

The reference xml file is as following:

    <?xml version="1.0" encoding="utf-8"?>
    <annotation>
        <object>
            <name>text</name>
            <bndbox>
                <xmin>158</xmin>
                <ymin>128</ymin>
                <xmax>411</xmax>
                <ymax>181</ymax>
            </bndbox>
        </object>
        <object>
            <name>text</name>
            <bndbox>
                <xmin>443</xmin>
                <ymin>128</ymin>
                <xmax>501</xmax>
                <ymax>169</ymax>
            </bndbox>
        </object>
        <folder></folder>
        <filename>100.jpg</filename>
        <size>
            <width>640</width>
            <height>480</height>
            <depth>3</depth>
        </size>
    </annotation>

Please let me know if you encounter any issues.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].