Semantic Segmentation of Large-Size VHR Remote Sensing Images Using a Two-Stage Multiscale Training Architecture - Details

Author：

Ding, Lei (Ding, Lei.) | Zhang, Jing (Zhang, Jing.) | Bruzzone, Lorenzo (Bruzzone, Lorenzo.)

Indexed by：

EI Scopus SCIE

Abstract：

Very-high　resolution　(VHR)　remote　sensing　images　(RSIs)　have　significantly　larger　spatial　size　compared　to　typical　natural　images　used　in　computer　vision　applications.　Therefore,　it　is　computationally　unaffordable　to　train　and　test　classifiers　on　these　images　at　a　full-size　scale.　Commonly　used　methodologies　for　semantic　segmentation　of　RSIs　perform　training　and　prediction　on　cropped　image　patches.　Thus,　they　have　the　limitation　of　failing　to　incorporate　enough　context　information.　In　order　to　better　exploit　the　correlations　between　ground　objects,　we　propose　a　deep　architecture　with　a　two-stage　multiscale　training　strategy　that　is　tailored　to　the　semantic　segmentation　of　large-size　VHR　RSIs.　In　the　first　stage　of　the　training　strategy,　a　semantic　embedding　network　is　designed　to　learn　high-level　features　from　downscaled　images　covering　a　large　area.　In　the　second　training　stage,　a　local　feature　extraction　network　is　designed　to　introduce　low-level　information　from　cropped　image　patches.　The　resulting　training　strategy　is　able　to　fuse　complementary　information　learned　from　multiple　levels　to　make　predictions.　Experimental　results　on　two　data　sets　show　that　it　outperforms　local-patch-based　training　models　in　terms　of　both　accuracy　and　stability.

Keyword：

semantic segmentation remote sensing deep learning Convolutional neural network

Author Community：

[ 1 ] [Ding, Lei]Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[ 2 ] [Bruzzone, Lorenzo]Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[ 3 ] [Zhang, Jing]Beijing Univ Technol, Dept Informat, Beijing 100124, Peoples R China

Reprint Author's Address：

[Bruzzone, Lorenzo]Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy

Email：

lei.ding@unitn.it |
zhangjing@emails.bjut.edu.cn |
lorenzo.btuzzone@ing.unitn.it

Show more details

Related Keywords：

Multi-Scale Context Aggregation for Semantic Segmentation of Remote Sensing Images
2020，REMOTE SENSING
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images
2022，IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
When zero-padding position encoding encounters linear space reduction attention: an efficient semantic segmentation Transformer of remote sensing images
2024，INTERNATIONAL JOURNAL OF REMOTE SENSING
MA-Unet:An improved version of Unet based on multi-scale and attention mechanism for medical image segmentation
2022，THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021)

Source ：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

ISSN： 0196-2892

Year： 2020

Issue： 8

Volume： 58

Page： 5367-5376

8 . 2 0 0

JCR@2022

ESI Discipline： GEOSCIENCES;

ESI HC Threshold：99

Cited Count：

WoS CC Cited Count： 114

SCOPUS Cited Count： 135

ESI Highly Cited Papers on the List： 4 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

理学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to