• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Liu, Hai (Liu, Hai.) | Zhang, Cheng (Zhang, Cheng.) | Deng, Yongjian (Deng, Yongjian.) | Liu, Tingting (Liu, Tingting.) | Zhang, Zhaoli (Zhang, Zhaoli.) | Li, You-Fu (Li, You-Fu.)

Indexed by:

EI SCIE

Abstract:

Head pose estimation (HPE) is an indispensable upstream task in the fields of human-machine interaction, self-driving, and attention detection. However, practical head pose applications suffer from several challenges, such as severe occlusion, low illumination, and extreme orientations. To address these challenges, we identify three cues from head images, namely, critical minority relationships, neighborhood orientation relationships, and significant facial changes. On the basis of the three cues, two key insights on head poses are revealed: 1) intra-orientation relationship and 2) cross-orientation relationship. To leverage two key insights above, a novel relationship-driven method is proposed based on the Transformer architecture, in which facial and orientation relationships can be learned. Specifically, we design several orientation tokens to explicitly encode basic orientation regions. Besides, a novel token guide multi-loss function is accordingly designed to guide the orientation tokens as they learn the desired regional similarities and relationships. Experimental results on three challenging benchmark HPE datasets show that our proposed TokenHPE achieves state-of-the-art performance. Moreover, qualitative visualizations are provided to verify the effectiveness of the token-learning methodology.

Keyword:

Task analysis Pose estimation deep learning Computer architecture attention mechanism Transformers Visualization Semantics Head Head pose estimation relationship perception transformer

Author Community:

  • [ 1 ] [Liu, Hai]Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China
  • [ 2 ] [Zhang, Cheng]Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China
  • [ 3 ] [Zhang, Zhaoli]Cent China Normal Univ, Natl Engn Res Ctr Elearning, Wuhan 430079, Peoples R China
  • [ 4 ] [Deng, Yongjian]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 5 ] [Liu, Tingting]Hubei Univ, Sch Educ, Wuhan 430062, Hubei, Peoples R China
  • [ 6 ] [Liu, Tingting]City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China
  • [ 7 ] [Li, You-Fu]City Univ Hong Kong, Dept Mech Engn, Hong Kong, Peoples R China

Reprint Author's Address:

Show more details

Related Keywords:

Source :

IEEE TRANSACTIONS ON IMAGE PROCESSING

ISSN: 1057-7149

Year: 2023

Volume: 32

Page: 6289-6302

1 0 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 103

ESI Highly Cited Papers on the List: 2 Unfold All

  • 2024-11
  • 2024-11

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:646/5311497
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.