A Text-Line Segmentation Method for Historical Tibetan Documents Based on Baseline Detection - Details

Author：

Li, Yanxing (Li, Yanxing.) | Ma, Longlong (Ma, Longlong.) | Duan, Lijuan (Duan, Lijuan.) (Scholars：段立娟) | Wu, Jian (Wu, Jian.)

Indexed by：

CPCI-S EI Scopus

Abstract：

Text-line　segmentation　is　an　important　task　in　the　historical　Tibetan　document　recognition.　Historical　Tibetan　document　images　usually　contain　touching　or　overlapping　characters　between　consecutive　text-lines,　making　text-line　segmentation　a　difficult　task.　In　this　paper,　we　present　a　text-line　segmentation　method　based　on　baseline　detection.　The　initial　positions　for　the　baseline　of　each　line　are　obtained　by　template　matching,　pruning　algorithms　and　closing　operation.　The　baseline　is　estimated　using　dynamic　tracing　within　pixel　points　of　each　line　and　the　context　information　between　pixel　points.　The　overlapping　or　touching　areas　are　cut　by　finding　the　minimum　width　stroke.　Finally,　text-lines　are　extracted　based　on　the　estimated　baseline　and　the　cut　position　of　touching　area.　The　proposed　algorithm　has　been　evaluated　on　the　dataset　of　historical　Tibetan　document　images.　Experimental　result　shows　the　effectiveness　of　the　proposed　method.

Keyword：

Historical Tibetan document Text-line segmentation Baseline detection

Author Community：

[ 1 ] [Li, Yanxing]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Wu, Jian]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Li, Yanxing]Beijing Key Lab Trusted Comp, Beijing, Peoples R China
[ 5 ] [Ma, Longlong]Chinese Acad Sci, Inst Software, Chinese Informat Proc Lab, Beijing, Peoples R China
[ 6 ] [Wu, Jian]Chinese Acad Sci, Inst Software, Chinese Informat Proc Lab, Beijing, Peoples R China
[ 7 ] [Duan, Lijuan]Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing, Peoples R China

Reprint Author's Address：

段立娟
[Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China;;[Duan, Lijuan]Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing, Peoples R China

Email：

liyanxing15@outlook.com |
longlong@iscas.ac.cn |
ljduan@bjut.edu.cn |
wujian@iscas.ac.cn

Show more details

Related Keywords：

Segmentation and Recognition for Historical Tibetan Document Images
2020，IEEE ACCESS
Text Extraction for Historical Tibetan Document Images Based on Connected Component Analysis and Corner Point Detection
2017，2nd CCF Chinese Conference on Computer Vision (CCCV)
A Touching Character Database from Tibetan Historical Documents to Evaluate the Segmentation Algorithm
2018，PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV

Source ：

COMPUTER VISION, PT I

ISSN： 1865-0929

Year： 2017

Volume： 771

Page： 356-367

Language： English

Cited Count：

WoS CC Cited Count： 7

SCOPUS Cited Count： 11

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to