• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhu, Shasha (Zhu, Shasha.) | Sun, Lu (Sun, Lu.) | Ma, Zeyuan (Ma, Zeyuan.) | Li, Chenxi (Li, Chenxi.) | He, Dongzhi (He, Dongzhi.)

收录:

EI Scopus SCIE

摘要:

Skeleton-based action recognition is a core task in the field of video understanding. Skeleton sequences are characterized by high information density, low redundancy, and clear structural information, thereby facilitating the analysis of complex relationships among human behaviors more readily than other modalities. Although existing studies have encoded skeleton data and achieved positive outcomes, they have often overlooked the precise high-level semantic information inherent in the action descriptions. To address this issue, this paper proposes a prompt-supervised dynamic attention graph convolutional network (PDA-GCN). Specifically, the PDA-GCN incorporates a prompt supervision (PS) module that leverages a pre-trained large-scale language model (LLM) as a knowledge engine and retains the generated text features as prompts to provide additional supervision during model training, enhancing the model's ability to discern analogous actions with negligible computational cost. In addition, for the purpose of bolstering the learning of discriminative features, a dynamic attention graph convolution (DA-GC) module is presented. This module utilizes self-attention mechanism to adaptively infer intrinsic relationships between joints and integrates dynamic convolution to strengthen the emphasis on local information. This dual focus on both global context and local details further amplifies the efficiency and effectiveness of the model. Extensive experiments, conducted on the widely-used skeleton-based action recognition datasets NTU RGB+D 60 and NTU RGB+D 120, demonstrate that the PDA-GCN surpasses known state-of-the-art methods, achieving accuracies of 93.4% on the NTU RGB+D 60 cross-subject split and 90.7% on the NTU RGB+D 120 cross-subject split.

关键词:

Attention mechanism Prompt learning Dynamic convolution Graph convolutional network Skeleton-based action recognition

作者机构:

  • [ 1 ] [Zhu, Shasha]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 2 ] [Sun, Lu]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Ma, Zeyuan]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 4 ] [Li, Chenxi]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
  • [ 5 ] [He, Dongzhi]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China

通讯作者信息:

  • [Zhu, Shasha]Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China;;

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

NEUROCOMPUTING

ISSN: 0925-2312

年份: 2024

卷: 611

6 . 0 0 0

JCR@2022

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:484/4967739
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司