A Spatial Relationship Preserving Adversarial Network for 3D Reconstruction from a Single Depth View - Details

Author：

Liu, Caixia (Liu, Caixia.) | Kong, Dehui (Kong, Dehui.) | Wang, Shaofan (Wang, Shaofan.) | Li, Jinghua (Li, Jinghua.) | Yin, Baocai (Yin, Baocai.) (Scholars：尹宝才)

Indexed by：

EI Scopus SCIE

Abstract：

Recovering　the　geometry　of　an　object　from　a　single　depth　image　is　an　interesting　yet　challenging　problem.　While　previous　learning　based　approaches　have　demonstrated　promising　performance,　they　don＇t　fully　explore　spatial　relationships　of　objects,　which　leads　to　unfaithful　and　incomplete　3D　reconstruction.　To　address　these　issues,　we　propose　a　Spatial　Relationship　Preserving　Adversarial　Network　(SRPAN)　consisting　of　3D　Capsule　Attention　Generative　Adversarial　Network　(3DCAGAN)　and　2D　Generative　Adversarial　Network　(2DGAN)　for　coarse-to-fine　3D　reconstruction　from　a　single　depth　view　of　an　object.　Firstly,　3DCAGAN　predicts　the　coarse　geometry　using　an　encoder-decoder　based　generator　and　a　discriminator.　The　generator　encodes　the　input　as　latent　capsules　represented　as　stacked　activity　vectors　with　local-to-global　relationships　(i.e.,　the　contribution　of　components　to　the　whole　shape),　and　then　decodes　the　capsules　by　modeling　local-to-local　relationships　(i.e.,　the　relationships　among　components)　in　an　attention　mechanism.　Afterwards,　2DGAN　refines　the　local　geometry　slice-by-slice,　by　using　a　generator　learning　a　global　structure　prior　as　guidance,　and　stacked　discriminators　enforcing　local　geometric　constraints.　Experimental　results　show　that　SRPAN　not　only　outperforms　several　state-of-the-art　methods　by　a　large　margin　on　both　synthetic　datasets　and　real-world　datasets,　but　also　reconstructs　unseen　object　categories　with　a　higher　accuracy.

Keyword：

latent capsule 3D reconstruction a single depth view self-attention

Author Community：

[ 1 ] [Liu, Caixia]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 2 ] [Kong, Dehui]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 3 ] [Wang, Shaofan]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 4 ] [Li, Jinghua]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 5 ] [Yin, Baocai]Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China

Reprint Author's Address：

Email：

lcxxib@emails.bjut.edu.cn |
kdh@bjut.edu.cn |
wangshaofan@bjut.edu.cn |
lijinghua@bjut.edu.cn |
ybc@bjut.edu.cn

Show more details

Related Keywords：

PlaneAC: Line-guided planar 3D reconstruction based on self-attention and convolution hybrid model
2024，PATTERN RECOGNITION
Multi-scale latent feature-aware network for logical partition based 3D voxel reconstruction
2023，NEUROCOMPUTING
Latent Feature-Aware and Local Structure-Preserving Network for 3D Completion from a Single Depth View
2021，30th International Conference on Artificial Neural Networks (ICANN)
SAQENet: A Quality Enhancement Network for Compressed Video with Self-attention
2022，DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC)

Source ：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS

ISSN： 1551-6857

Year： 2022

Issue： 4

Volume： 18

5 . 1

JCR@2022

5 . 1 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：1

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 3

SCOPUS Cited Count： 11

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to