Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

Long-range　contextual　information　is　crucial　for　the　semantic　segmentation　of　high-resolution　(HR)　remote　sensing　images　(RSIs).　However,　image　cropping　operations,　commonly　used　for　training　neural　networks,　limit　the　perception　of　long-range　contexts　in　large　RSIs.　To　overcome　this　limitation,　we　propose　a　wide-context　network　(WiCoNet)　for　the　semantic　segmentation　of　HR　RSIs.　Apart　from　extracting　local　features　with　a　conventional　convolutional　neural　network　(CNN),　the　WiCoNet　has　an　extra　context　branch　to　aggregate　information　from　a　larger　image　area.　Moreover,　we　introduce　a　context　transformer　to　embed　contextual　information　from　the　context　branch　and　selectively　project　it　onto　the　local　features.　The　context　transformer　extends　the　vision　transformer,　an　emerging　kind　of　neural　networks,　to　model　the　dual-branch　semantic　correlations.　It　overcomes　the　locality　limitation　of　CNNs　and　enables　the　WiCoNet　to　see　the　bigger　picture　before　segmenting　the　land-cover/land-use　(LCLU)　classes.　Ablation　studies　and　comparative　experiments　conducted　on　several　benchmark　datasets　demonstrate　the　effectiveness　of　the　proposed　method.　In　addition,　we　present　a　new　Beijing　Land-Use　(BLU)　dataset.　This　is　a　large-scale　HR　satellite　dataset　with　high-quality　and　fine-grained　reference　labels,　which　can　facilitate　future　studies　in　this　field.

Keyword：

vision transformer (ViT) Task analysis Transformers semantic segmentation Semantics Context modeling Convolutional neural networks Feature extraction Convolutional neural network Image segmentation remote sensing

Author Community：

[ 1 ] [Ding, Lei]PLA Strateg Force Informat Engn Univ, Zhengzhou 450001, Peoples R China
[ 2 ] [Lin, Dong]Space Engn Univ, Beijing 102249, Peoples R China
[ 3 ] [Lin, Dong]Xian Inst Surveying & Mapping, State Key Lab Geoinformat Engn, Xian 710054, Peoples R China
[ 4 ] [Lin, Shaofu]Beijing Univ Technol, Fac Informat Technol, Beijing 100022, Peoples R China
[ 5 ] [Zhang, Jing]Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[ 6 ] [Bruzzone, Lorenzo]Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[ 7 ] [Cui, Xiaojie]Beijing Inst Remote Sensing Informat, Beijing 100011, Peoples R China
[ 8 ] [Wang, Yuebin]China Univ Geosci Beijing, Sch Land Sci & Technol, Beijing 100084, Peoples R China
[ 9 ] [Tang, Hao]Swiss Fed Inst Technol, Dept Informat Technol & Elect Engn, CH-8092 Zurich, Switzerland

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Distributed Learning based on Asynchronized Discriminator GAN for remote sensing image segmentation
2022，8th International Conference on Communication and Information Processing, ICCIP 2022
Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images
2020，REMOTE SENSING
An effective road extraction method from remote sensing images based on self-adaptive threshold function
2019，2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
General image segmentation by Deeper Residual U-Net
2019，4th International Conference on Mathematics and Artificial Intelligence, ICMAI 2019

Source ：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

ISSN： 0196-2892

Year： 2022

Volume： 60

8 . 2

JCR@2022

8 . 2 0 0

JCR@2022

ESI Discipline： GEOSCIENCES;

ESI HC Threshold：38

JCR Journal Grade：1

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 113

SCOPUS Cited Count： 113

ESI Highly Cited Papers on the List： 12 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to