• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Yao, Jiacheng (Yao, Jiacheng.) | Zhang, Jing (Zhang, Jing.) (学者:张菁) | Zhang, Hui (Zhang, Hui.) | Zhuo, Li (Zhuo, Li.)

收录:

EI Scopus SCIE

摘要:

With the rapid expansion of the we-media industry, streamers have increasingly incorporated inappropriate content into live videos to attract traffic and pursue interests. Blacklisted streamers often forge their identities or switch platforms to continue streaming, causing significant harm to the online environment. Consequently, streamer re-identification (re-ID) has become of paramount importance. Streamer biometrics in live videos exhibit multimodal characteristics, including voiceprints, faces, and spatiotemporal information, which complement each other. Therefore, we propose alight cross-modal attention network (LCMA-Net) for streamer re-ID in live videos. First, the voiceprint, face, and spatiotemporal features of the streamer are extracted by RawNetSA, Pi- Net, and STDA-ResNeXt3D, respectively. We then design alight cross-modal pooling attention (LCMPA) module, which, combined with a multilayer perceptron (MLP), aligns and concatenates different modality features into multimodal features within the LCMA-Net. Finally, the streamer is re-identified by measuring the similarity between these multimodal features. Five experiments were conducted on the StreamerReID dataset, and the results demonstrated that the proposed method achieved competitive performance. The dataset and code are available at https://github.com/BJUT-AIVBD/LCMA-Net.

关键词:

Live video Light cross-modal attention network Re-identification Light cross-modal pooling attention Streamer

作者机构:

  • [ 1 ] [Yao, Jiacheng]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Zhang, Jing]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Zhang, Hui]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Zhuo, Li]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Yao, Jiacheng]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 6 ] [Zhang, Jing]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 7 ] [Zhang, Hui]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
  • [ 8 ] [Zhuo, Li]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China

通讯作者信息:

  • [Zhang, Jing]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China;;

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

COMPUTER VISION AND IMAGE UNDERSTANDING

ISSN: 1077-3142

年份: 2024

卷: 249

4 . 5 0 0

JCR@2022

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:377/4954751
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司