A lightweight network with attention decoder for real-time semantic segmentation - Details

Author：

Wang, Kang (Wang, Kang.) | Yang, Jinfu (Yang, Jinfu.) (Scholars：杨金福) | Yuan, Shuai (Yuan, Shuai.) | Li, Mingai (Li, Mingai.) (Scholars：李明爱)

Indexed by：

EI Scopus SCIE

Abstract：

As　an　important　task　in　scene　understanding,　semantic　segmentation　requires　a　large　amount　of　computation　to　achieve　high　performance.　In　recent　years,　with　the　rise　of　autonomous　systems,　it　is　crucial　to　make　a　trade-off　in　terms　of　accuracy　and　speed.　In　this　paper,　we　propose　a　novel　asymmetric　encoder-decoder　network　structure　to　address　this　problem.　In　the　encoder,　we　design　a　Separable　Asymmetric　Module,　which　combines　depth-wise　separable　asymmetric　convolution　with　dilated　convolution　to　greatly　reduce　computation　cost　while　maintaining　accuracy.　On　the　other　hand,　an　attention　mechanism　is　also　used　in　the　decoder　to　further　improve　segmentation　performance.　Experimental　results　on　CityScapes　and　CamVid　datasets　show　that　the　proposed　method　can　achieve　a　better　balance　between　segmentation　precision　and　speed　compared　with　state-of-the-art　semantic　segmentation　methods.　Specifically,　our　model　obtains　mean　IoU　of　72.5%　and　66.3%　on　CityScapes　and　CamVid　test　dataset,　respectively,　with　less　than　1M　parameters.

Keyword：

decoder structure Dilated convolution Depth-wise separable asymmetric convolution Semantic segmentation Attention mechanism Encoder&#8211

Author Community：

[ 1 ] [Wang, Kang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Yang, Jinfu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Yuan, Shuai]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Li, Mingai]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Reprint Author's Address：

[Wang, Kang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Email：

201861365@emails.bjut.edu.cn

Show more details

Related Keywords：

Lw-TISNet: Light-Weight Convolutional Neural Network Incorporating Attention Mechanism and Multiple Supervision Strategy for Tongue Image Segmentation
2022，Sensing and Imaging
UTSN-net: Medical Image Semantic Segmentation Model Based On Skip Non-local Attention Module
2023，8th International Conference on Electronic Technology and Information Science, ICETIS 2023
A Semantic Segmentation Method with Emphasis on the Edges for Automatic Vessel Wall Analysis
2022，APPLIED SCIENCES-BASEL
EISNet: A Multi-Modal Fusion Network for Semantic Segmentation With Events and Images
2024，IEEE TRANSACTIONS ON MULTIMEDIA

Source ：

VISUAL COMPUTER

ISSN： 0178-2789

Year： 2021

Issue： 7

Volume： 38

Page： 2329-2339

3 . 5 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：87

JCR Journal Grade：2

Cited Count：

WoS CC Cited Count： 12

SCOPUS Cited Count： 13

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to