Natural Scene Text Detection Based on Deep Supervised Fully Convolutional Network - Details

Author：

Zhang, Nan (Zhang, Nan.) (Scholars：张楠) | Jin, Xiaoning (Jin, Xiaoning.) | Li, Xiaowei (Li, Xiaowei.)

Indexed by：

CPCI-S EI Scopus

Abstract：

In　the　past　few　years,　text　detection　in　natural　scenes　has　attracted　increasing　attention　due　to　many　real-world　applications.　Most　existing　methods　only　detect　horizontal　or　nearly　horizontal　texts　and　have　complicated　processes.　When　using　the　neural　network　to　detect　text　in　the　image,　some　ambiguity　and　small　words　are　easy　to　be　ignored　because　of　many　pooling　operations.　Therefore,　this　paper　proposes　an　end-to-end　trainable　neural　network　for　detecting　multi-oriented　text　lines　or　words　in　natural　scene　images.　The　network　fuses　multi-level　features　and　is　guided　by　deep　supervision　during　training.　In　this　way,　richer　hierarchical　representations　can　be　learned　automatically.　The　network　makes　two　kinds　of　predictions:　text/no　text　classification　and　location　regression,　thus　we　can　directly　locate　multi-oriented　words　or　text　lines　without　other　unnecessary　intermediate　steps.　Experimental　results　on　the　ICDAR　2015　datasets　and　MSRA-TD500　datasets　have　proven　that　the　proposed　method　outperforms　the　state-of-the-art　methods　by　a　noticeable　margin　on　F-score.

Keyword：

Multi-oriented text Deep supervision Scene image

Author Community：

[ 1 ] [Zhang, Nan]Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
[ 2 ] [Jin, Xiaoning]Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China
[ 3 ] [Li, Xiaowei]Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China

Reprint Author's Address：

[Jin, Xiaoning]Beijing Univ Technol, Beijing Adv Innovat Ctr Future Internet Technol, Beijing, Peoples R China

Email：

jinxn@bjut.edu.cn

Show more details

Related Keywords：

Fetal Ultrasound Image Segmentation for Automatic Head Circumference Biometry Using Deeply Supervised Attention-Gated V-Net
2021，JOURNAL OF DIGITAL IMAGING
Deeply supervised vestibule segmentation network for CT images with global context-aware pyramid feature extraction
2022，IET IMAGE PROCESSING
Indoor Scene Classification Based on Mid-Level Features
2017，International Conference on Information Technology and Intelligent Transportation Systems (ITITS)
An effective method for gender classification with convolutional neural networks
2015，15th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2015

Source ：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III

ISSN： 0302-9743

Year： 2018

Volume： 11166

Page： 439-448

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部

城市建设学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to