• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Xiqun (Zhang, Xiqun.) | Duan, Lijuan (Duan, Lijuan.) (学者:段立娟) | Ma, Longlong (Ma, Longlong.) | Wu, Jian (Wu, Jian.)

收录:

CPCI-S EI Scopus

摘要:

In this paper, we present a text extraction method for historical Tibetan document images. The task of text extraction is considered as text area detection and location problem. Firstly, the historical Tibetan document image is preprocessed to correct imbalanced illumination, tilt and noises, then get the binary image. Secondly, the regions of interest in historical Tibetan documents are divided into three categories using connected components. The images are divided equally into grids and the grids are filtered by the information of the categories of CCs and corner point density. The remaining grids are used to compute vertical and horizontal grid projections. Thirdly, by analyzing the projections, the approximate location of the text area can be detected. Finally, the text area is extracted accurately by correcting the bounding box of the approximate text area. Experiments on the dataset of historical Tibetan document images demonstrate the effectiveness of the proposed method.

关键词:

Connected components Corner point Historical Tibetan document Text extraction

作者机构:

  • [ 1 ] [Zhang, Xiqun]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
  • [ 2 ] [Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
  • [ 3 ] [Zhang, Xiqun]Beijing Key Lab Trusted Comp, Beijing, Peoples R China
  • [ 4 ] [Duan, Lijuan]Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing, Peoples R China
  • [ 5 ] [Ma, Longlong]Chinese Acad Sci, Inst Software, Chinese Informat Proc Lab, Beijing, Peoples R China
  • [ 6 ] [Wu, Jian]Chinese Acad Sci, Inst Software, Chinese Informat Proc Lab, Beijing, Peoples R China

通讯作者信息:

  • [Ma, Longlong]Chinese Acad Sci, Inst Software, Chinese Informat Proc Lab, Beijing, Peoples R China

查看成果更多字段

相关关键词:

来源 :

COMPUTER VISION, PT II

ISSN: 1865-0929

年份: 2017

卷: 772

页码: 545-555

语种: 英文

被引次数:

WoS核心集被引频次: 6

SCOPUS被引频次: 8

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:77/2875925
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司