• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Hongyu (Zhang, Hongyu.) | Wang, Suyu (Wang, Suyu.) | Zhang, Yingying (Zhang, Yingying.)

收录:

EI Scopus

摘要:

The encoder-decoder structure is the basic structure of most semantic segmentation models and is adopted by a large number of segmentation models. How to effectively extract image features and achieve high-precision mapping through the optimal design of encoder and decoder is the key issue of current research. SegFormer designs an encoder with excellent performance, which fully extracts the feature information of different semantic granularity in the image with a large receptive field. Even if a simple fully connected layer decoder is used, excellent segmentation results can also be achieved. However, this simplified decoder does not make full use of the advantages of the SegFormer encoder. Therefore, a decoder structure with dual-path multi-scale feature fusion is designed in this paper, and the decoder is redesigned according to the characteristics of the SegFormer encoder. The decoder adopts a dual-path structure, one path passes the abstract global information layer by layer to the local detail information through the layer-by-layer upsampling fusion module (LFM), and gradually upsamples the feature maps obtained from the encoder, and then use the channel fusion module to learn the importance of different channels in the deep abstract semantic feature map and the shallow local detail feature map, and perform dynamic fusion to obtain a feature map containing both abstract semantic information and local details. The other path takes advantage of the large receptive field of the feature map output by the SegFormer encoder, and uses the weighted hybrid multi-scale feature extraction module (WMF) to extract multi-scale features containing global semantics from the deep semantic feature map finally output by the encoder. Finally, the Deep Feature Fusion Module (DFM) is used to fuse the outputs of the first two modules, fully mining the multi-scale global information in the encoder, and obtaine the feature maps with rich semantic information, which effectively improves the algorithm model performance. © 2023 The Authors. Published by Elsevier B.V.

关键词:

Semantics Semantic Segmentation Channel coding Signal encoding Decoding

作者机构:

  • [ 1 ] [Zhang, Hongyu]Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Wang, Suyu]Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Zhang, Yingying]Shenyang University of Chemical Technology, Shenyang; 110142, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

年份: 2023

卷: 222

页码: 157-166

语种: 英文

被引次数:

WoS核心集被引频次:

SCOPUS被引频次: 1

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 1

归属院系:

在线人数/总访问数:406/4878616
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司