• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

En, Qing (En, Qing.) | Duan, Lijuan (Duan, Lijuan.) (学者:段立娟) | Zhang, Zhaoxiang (Zhang, Zhaoxiang.) | Bai, Xiang (Bai, Xiang.) | Zhang, Yundong (Zhang, Yundong.)

收录:

EI

摘要:

We explore a principle method to address the weakly supervised detection problem. Many deep learning methods solve weakly supervised detection by mining various object proposal or pooling strategies, which may cause redundancy and generate a coarse location. To overcome this limitation, we propose a novel human-like active searching strategy that recurrently ignores the background and discovers class-specific objects by erasing undesired pixels from the image. The proposed detector acts as an agent, providing guidance to erase unremarkable regions and eventually concentrating the attention on the foreground. The proposed agents, which are composed of a deep Q-network and are trained by the Q-learning algorithm, analyze the contents of the image features to infer the localization action according to the learned policy. To the best of our knowledge, this is the first attempt to apply reinforcement learning to address weakly supervised localization with only image-level labels. Consequently, the proposed method is validated on the PASCAL VOC 2007 and PASCAL VOC 2012 datasets. The experimental results show that the proposed method is capable of locating a single object within 5 steps and has great significance to the research on weakly supervised localization with a human-like mechanism. © 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org).

关键词:

Reinforcement learning Learning algorithms Deep learning Object detection Learning systems

作者机构:

  • [ 1 ] [En, Qing]Beijing Key Laboratory of Trusted Computing, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Duan, Lijuan]Beijing Key Laboratory of Trusted Computing, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Zhang, Zhaoxiang]Center for Research on Intelligent Perception and Computing, National Laboratory of Pattern Reconition, Institute of Automation, Chinese Academy of Sciences, Beijing; 100190, China
  • [ 4 ] [Bai, Xiang]School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan; 430074, China
  • [ 5 ] [Zhang, Yundong]State Key Lab of Digital Multimedia Chip Technology, Vimicro Corp, Beijing; 100191, China

通讯作者信息:

  • [zhang, zhaoxiang]center for research on intelligent perception and computing, national laboratory of pattern reconition, institute of automation, chinese academy of sciences, beijing; 100190, china

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

年份: 2019

页码: 3502-3509

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:308/3894964
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司