• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Chunjie (Zhang, Chunjie.) | Zhang, Yifan (Zhang, Yifan.) | Wang, Shuhui (Wang, Shuhui.) | Pang, Junbiao (Pang, Junbiao.) (学者:庞俊彪) | Liang, Chao (Liang, Chao.) | Huang, Qingming (Huang, Qingming.) | Tian, Qi (Tian, Qi.)

收录:

EI Scopus

摘要:

The bag of visual words model (BoW) and its variants have demonstrate their effectiveness for visual applications and have been widely used by researchers. The BoW model first extracts local features and generates the corresponding codebook, the elements of a codebook are viewed as visual words. The local features within each image are then encoded to get the final histogram representation. However, the codebook is dataset dependent and has to be generated for each image dataset. This costs a lot of computational time and weakens the generalization power of the BoW model. To solve these problems, in this paper, we propose to undo the dataset bias by codebook linear transformation. To represent every points within the local feature space using Euclidean distance, the number of bases should be no less than the space dimensions. Hence, each codebook can be viewed as a linear transformation of these bases. In this way, we can transform the pre-learned codebooks for a new dataset. However, not all of the visual words are equally important for the new dataset, it would be more effective if we can make some selection using sparsity constraints and choose the most discriminative visual words for transformation. We propose an alternative optimization algorithm to jointly search for the optimal linear transformation matrixes and the encoding parameters. Image classification experimental results on several image datasets show the effectiveness of the proposed method. Copyright © 2013 ACM.

关键词:

Linear transformations Mathematical transformations Classification (of information)

作者机构:

  • [ 1 ] [Zhang, Chunjie]School of Computer and Control Engineering, University of Chinese Academy of Sciences, 100049, Beijing, China
  • [ 2 ] [Zhang, Yifan]National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
  • [ 3 ] [Wang, Shuhui]Key Lab of Intell. Info. Process, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
  • [ 4 ] [Pang, Junbiao]College of Computer Science and Technology, Beijing University of Technology, 100124 Beijing, China
  • [ 5 ] [Liang, Chao]School of Computer, National Engineering Research Center for Multimedia Software, Wuhan University, 430072, Wuhan, China
  • [ 6 ] [Huang, Qingming]School of Computer and Control Engineering, University of Chinese Academy of Sciences, 100049, Beijing, China
  • [ 7 ] [Huang, Qingming]Key Lab of Intell. Info. Process, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
  • [ 8 ] [Tian, Qi]Department of Computer Sciences, University of Texas at San Antonio, San Antonio, TX 78249, United States

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

年份: 2013

页码: 533-536

语种: 英文

被引次数:

WoS核心集被引频次:

SCOPUS被引频次: 4

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

在线人数/总访问数:158/4512882
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司