A touching character database from Tibetan historical documents to evaluate the segmentation algorithm - Details

Author：

Zhao, Quanchao (Zhao, Quanchao.) | Ma, Long-long (Ma, Long-long.) | Duan, Lijuan (Duan, Lijuan.) (Scholars：段立娟)

Indexed by：

Abstract：

The　benchmarking　database　plays　an　essential　role　in　evaluating　the　performance　of　the　touching　character　string　segmentation　algorithm.　In　this　paper,　we　present　a　new　touching　Tibetan　character　strings　database.　Firstly,　using　the　previous　proposed　layout　analysis　and　text-line　segmentation　algorithms,　we　segment　scanned　images　of　historical　Tibetan　documents　into　text-line　images.　Then,　we　find　candidate　touching　Tibetan　character　strings　using　connected　component　analysis　and　screen　out　the　correct　touching　samples.　Finally,　we　annotate　the　data　manually　and　establish　the　touching　character　database.　The　database　contains　5,844　images　of　two-touching　characters　and　1,399　images　of　more　than　two-touching　characters.　It　is　applicable　to　evaluate　the　segmentation　algorithms　for　the　touching　Tibetan　character　strings.　For　each　image,　the　annotated　ground　truth　file　includes　class　labels,　candidate　segment　points,　baseline　and　average　stroke　width　of　a　Tibetan　single　character.　According　to　the　type　of　touching,　we　divide　the　touching　character　string　into　three　types:　AB,　OB　and　BB.　We　also　count　the　number　of　different　type　of　samples　and　find　that　76.27%　of　the　samples　belongs　to　the　third　type　(BB).　In　the　end,　we　measure　the　performance　of　the　over-segmentation　algorithm　on　this　database　for　reference.　©　Springer　Nature　Switzerland　AG　2018.

Keyword：

Computer vision Database systems Image segmentation Touch screens Benchmarking

Author Community：

[ 1 ] [Zhao, Quanchao]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Zhao, Quanchao]Beijing Key Laboratory of Trusted Computing, Beijing, China
[ 3 ] [Ma, Long-long]Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences, Beijing, China
[ 4 ] [Duan, Lijuan]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 5 ] [Duan, Lijuan]Beijing Key Laboratory on Integration and Analysis of Large-Scale Stream Data, Beijing, China

Reprint Author's Address：

[zhao, quanchao]beijing key laboratory of trusted computing, beijing, china;;[zhao, quanchao]faculty of information technology, beijing university of technology, beijing, china

Email：

quanchaozhao@yeah.net

Show more details

Related Keywords：

A off-line system on detection of overprint deviation in color printing
2008，Journal of Beijing University of Technology
Research on Direction Recognition of Plug Seedling of Vegetable Based on Image Morphology
2018，2nd IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, IMCEC 2018
Study on machine vision based on bottle inspecting system
2007，Journal of Beijing University of Technology
Automated estimation of ore size distributions based on machine vision
2014，2012 International Conference on Electrical and Electronics Engineering, ICEE 2012
A multi-stage segmentation based on inner-class relation with discriminative learning
2014，9th International Conference on Computer Vision Theory and Applications, VISAPP 2014

Source ：

ISSN： 0302-9743

Year： 2018

Volume： 11259 LNCS

Page： 309-321

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 7

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to