收录:
摘要:
In this paper, we propose a deep multimodal feature learning (DMFL) network for RGB-D salient object detection. The color and depth features are firstly extracted from low level to high level feature using CNN. Then the features at the high layer are shared and concatenated to construct joint feature representation of multi-modalities. The fused features are embedded to a high dimension metric space to express the salient and non-salient parts. And also a new objective function, consisting of cross-entropy and metric loss, is proposed to optimize the model. Both pixel and attribute level discriminative features are learned for semantical grouping to detect the salient objects. Experimental results show that the proposed model achieves promising performance and has about 1% to 2% improvement to conventional methods.
关键词:
通讯作者信息:
电子邮件地址:
来源 :
COMPUTERS & ELECTRICAL ENGINEERING
ISSN: 0045-7906
年份: 2021
卷: 92
4 . 3 0 0
JCR@2022
ESI学科: COMPUTER SCIENCE;
ESI高被引阀值:87
JCR分区:2
归属院系: