A deep multimodal feature learning network for RGB-D salient object detection - Details

Author：

Liang, Fangfang (Liang, Fangfang.) | Duan, Lijuan (Duan, Lijuan.) (Scholars：段立娟) | Ma, Wei (Ma, Wei.) | Qiao, Yuanhua (Qiao, Yuanhua.) | Miao, Jun (Miao, Jun.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　we　propose　a　deep　multimodal　feature　learning　(DMFL)　network　for　RGB-D　salient　object　detection.　The　color　and　depth　features　are　firstly　extracted　from　low　level　to　high　level　feature　using　CNN.　Then　the　features　at　the　high　layer　are　shared　and　concatenated　to　construct　joint　feature　representation　of　multi-modalities.　The　fused　features　are　embedded　to　a　high　dimension　metric　space　to　express　the　salient　and　non-salient　parts.　And　also　a　new　objective　function,　consisting　of　cross-entropy　and　metric　loss,　is　proposed　to　optimize　the　model.　Both　pixel　and　attribute　level　discriminative　features　are　learned　for　semantical　grouping　to　detect　the　salient　objects.　Experimental　results　show　that　the　proposed　model　achieves　promising　performance　and　has　about　1%　to　2%　improvement　to　conventional　methods.

Keyword：

Salient object detection Metric space Multimodal feature learning RGB-D images

Author Community：

[ 1 ] [Liang, Fangfang]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Ma, Wei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Liang, Fangfang]Beijing Key Lab Trusted Comp, Beijing, Peoples R China
[ 5 ] [Duan, Lijuan]Beijing Key Lab Trusted Comp, Beijing, Peoples R China
[ 6 ] [Ma, Wei]Beijing Key Lab Trusted Comp, Beijing, Peoples R China
[ 7 ] [Liang, Fangfang]Natl Engn Lab Crit Technol Informat Secur Classif, Beijing, Peoples R China
[ 8 ] [Duan, Lijuan]Natl Engn Lab Crit Technol Informat Secur Classif, Beijing, Peoples R China
[ 9 ] [Ma, Wei]Natl Engn Lab Crit Technol Informat Secur Classif, Beijing, Peoples R China
[ 10 ] [Qiao, Yuanhua]Beijing Univ Technol, Coll Appl Sci, Beijing, Peoples R China
[ 11 ] [Miao, Jun]Beijing Informat Sci & Technol Univ, Sch Comp Sci, Beijing Key Lab Internet Culture & Digital Dissem, Beijing, Peoples R China

Reprint Author's Address：

段立娟
[Duan, Lijuan]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

Email：

ljduan@bjut.edu.cn

Show more details

Related Keywords：

Infrared salient object detection based on global guided lightweight non-local deep features
2021，INFRARED PHYSICS & TECHNOLOGY
Material based salient object detection from hyperspectral images
2018，PATTERN RECOGNITION
CoCNN: RGB-D deep fusion for stereoscopic salient object detection
2020，PATTERN RECOGNITION
Semantic Labeling of Indoor Scenes from RGB-D Images with Discriminative Learning
2013，6th International Conference on Machine Vision (ICMV)

Source ：

COMPUTERS & ELECTRICAL ENGINEERING

ISSN： 0045-7906

Year： 2021

Volume： 92

4 . 3 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：87

JCR Journal Grade：2

Cited Count：

WoS CC Cited Count： 5

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to