• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Pang, Junbiao (Pang, Junbiao.) (学者:庞俊彪) | Hu, Anjing (Hu, Anjing.) | Huang, Qingming (Huang, Qingming.) (学者:黄庆明) | Tian, Qi (Tian, Qi.) | Yin, Baocai (Yin, Baocai.) (学者:尹宝才)

收录:

EI Scopus SCIE

摘要:

Organizing webpages into interesting topics is one of the key steps to understand the trends from multimodal Web data. The sparse, noisy, and less-constrained user-generated content results in inefficient feature representations. These descriptors unavoidably cause that a detected topic still contains a certain number of the false detected webpages, which further make a topic be less coherent, less interpretable, and less useful. In this paper, we address this problem from a viewpoint interpreting a topic by its prototypes, and present a two-step approach to achieve this goal. Following the detection-by-ranking approach, a sparse Poisson deconvolution is proposed to learn the intratopic similarities between webpages. To find the prototypes, leveraging the intratopic similarities, top-k diverse yet representative prototype webpages are identified from a submodularity function. Experimental results not only show the improved accuracies for the Web topic detection task, but also increase the interpretation of a topic by its prototypes on two public datasets.

关键词:

Poisson deconvolution sparsity Web topic detection submodularity prototype learning (PL) topic interpretation

作者机构:

  • [ 1 ] [Pang, Junbiao]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 2 ] [Hu, Anjing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
  • [ 3 ] [Huang, Qingming]Univ Chinese Acad Sci, Chinese Acad Sci, Beijing 100049, Peoples R China
  • [ 4 ] [Huang, Qingming]Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
  • [ 5 ] [Tian, Qi]Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
  • [ 6 ] [Yin, Baocai]Dalian Univ Technol, Adv Invocat Ctr Future Internet Technol, Dalian 116024, Peoples R China

通讯作者信息:

  • 庞俊彪 黄庆明

    [Pang, Junbiao]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China;;[Huang, Qingming]Univ Chinese Acad Sci, Chinese Acad Sci, Beijing 100049, Peoples R China

查看成果更多字段

相关关键词:

来源 :

IEEE TRANSACTIONS ON CYBERNETICS

ISSN: 2168-2267

年份: 2019

期: 3

卷: 49

页码: 1072-1083

1 1 . 8 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:147

JCR分区:1

被引次数:

WoS核心集被引频次: 2

SCOPUS被引频次: 5

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:188/4608910
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司