• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Chunjie (Zhang, Chunjie.) | Liang, Chao (Liang, Chao.) | Pang, Junbiao (Pang, Junbiao.) (学者:庞俊彪) | Zhang, Yifan (Zhang, Yifan.) | Liu, Jing (Liu, Jing.) | Qin, Lei (Qin, Lei.) | Huang, Qingming (Huang, Qingming.)

收录:

EI Scopus SCIE

摘要:

The bag of visual words model (BoW) and its variants have demonstrated their effectiveness for visual applications. The BoW model first extracts local features and generates the corresponding codebook where the elements of a codebook are viewed as visual words. However, the codebook is dataset dependent and has to be generated for each image dataset. Besides, when we only have a limited number of training images, the codebook generated correspondingly may not be able to encode images well. This requires a lot of computational time and weakens the generalization power of the BoW model. To solve these problems, in this paper, we propose to undo the dataset bias by linear codebook transformation in an unsupervised manner. To represent each point in the local feature space, we need a number of linearly independent basis vectors. We view the codebook as a linear transformation of these basis vectors. In this way, we can transform the pre-learned codebooks for a new dataset using the pseudo-inverse of the transformation matrix. However, this is an under-determined problem which may lead to many solutions. Besides, not all of the visual words are equally important for the new dataset. It would be more effective if we can make some selection and choose the discriminative visual words for transformation. Specifically, the sparsity constraints and the F-norm of the transformation matrix are used in this paper. We propose an alternative optimization algorithm to jointly search for the optimal linear transformation matrixes and the encoding parameters. The proposed method needs no labeled images from either the source dataset or the target dataset. Image classification experimental results on several image datasets show the effectiveness of the proposed method. (C) 2014 Elsevier B.V. All rights reserved.

关键词:

Codebook bias Sparsity Alternative optimization Linear transformation

作者机构:

  • [ 1 ] [Zhang, Chunjie]Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
  • [ 2 ] [Huang, Qingming]Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100049, Peoples R China
  • [ 3 ] [Liang, Chao]Wuhan Univ, Sch Comp, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
  • [ 4 ] [Pang, Junbiao]Beijing Univ Technol, Coll Metropolitan Transportat, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 5 ] [Zhang, Yifan]Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
  • [ 6 ] [Liu, Jing]Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
  • [ 7 ] [Qin, Lei]Chinese Acad Sci, Inst Comp Technol, Key Lab Intell Info Proc, Beijing 100190, Peoples R China
  • [ 8 ] [Huang, Qingming]Chinese Acad Sci, Inst Comp Technol, Key Lab Intell Info Proc, Beijing 100190, Peoples R China

通讯作者信息:

  • [Liang, Chao]Wuhan Univ, Sch Comp, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China

查看成果更多字段

相关关键词:

来源 :

PATTERN RECOGNITION LETTERS

ISSN: 0167-8655

年份: 2014

卷: 45

页码: 197-204

5 . 1 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:176

JCR分区:2

中科院分区:3

被引次数:

WoS核心集被引频次: 1

SCOPUS被引频次: 1

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

在线人数/总访问数:1161/3892882
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司