Multi-scale Feature Injection for Occluded 3D Human Pose and Shape Estimation - Details

Author：

Shi, Yunhui (Shi, Yunhui.) (Scholars：施云惠) | Ge, Yangyang (Ge, Yangyang.) | Wang, Jin (Wang, Jin.)

Indexed by：

CPCI-S EI

Abstract：

3D　reconstruction　has　been　applied　to　many　research　fields　such　as　robots　and　computer　vision　with　the　fast　development　of　technology.　Despite　significant　progress,　current　3D　human　pose　and　shape　estimation　methods　still　remain　challenge　to　recovery　3D　human　mesh　under　occlusions.　Previous　works　use　a　Iterative　Error　Feedback　(IEF)　loop　to　construct　the　regressor　and　often　have　disregarded　information　at　occluded　regions　that　make　them　difficult　to　handle　occlusions.　However,　we　argue　that　occluded　regions　have　strong　correlations　with　human　body　so　that　they　can　offer　effective　information　for　3D　human　pose　and　shape　estimation.　To　address　this,　we　propose　a　multi-scale　feature　injection　network　MFINet,　that　utilizes　the　information　at　occluded　regions　as　a　secondary　clews　to　enrich　the　image　features　in　a　coarse-to-fine　manner.　In　MFInet,　given　the　image　feature　at　current　scale,　a　Transformer-based　module,　called　feature　inject　transformer　module　(FIM)　is　used　to　inject　human　feature　into　occluded　region　by　considering　their　correlation.　To　this　end,　experiments　show　that　our　method　is　effective　in　both　object　and　subject　results　on　several　benchmarks　including　Human3.6M,　3DPW,　LSP　and　COCO.

Keyword：

multi-scale 3D human reconstruction transformer

Author Community：

[ 1 ] [Shi, Yunhui]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Ge, Yangyang]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Wang, Jin]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

Reprint Author's Address：

Email：

syhzm@bjut.edu.cn |
geyy@emails.bjut.edu.cn |
ijinwang@bjut.edu.cn

Show more details

Related Keywords：

Feature Pyramid Based Scene Text Detector
2017，14th IAPR International Conference on Document Analysis and Recognition (ICDAR)
Tongue-coating image segmentation based on combination of morphological gradient and watershed algorithms
2011，IMAGING SCIENCE JOURNAL
Attention Multi-Scale Network for Automatic Layer Extraction of Ice Radar Topological Sequences
2021，REMOTE SENSING
MA-Unet:An improved version of Unet based on multi-scale and attention mechanism for medical image segmentation
2022，THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021)

Source ：

2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC

ISSN： 1948-9439

Year： 2023

Page： 4881-4886

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to