Semi-supervised Video Object Segmentation Via an Edge Attention Gated Graph Convolutional Network - Details

Author：

Zhang, Yuqing (Zhang, Yuqing.) | Zhang, Yong (Zhang, Yong.) (Scholars：张勇) | Wang, Shaofan (Wang, Shaofan.) | Liang, Yun (Liang, Yun.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

EI Scopus SCIE

Abstract：

Video　object　segmentation　(VOS)　exhibits　heavy　occlusions,　large　deformation,　and　severe　motion　blur.　While　many　remarkable　convolutional　neural　networks　are　devoted　to　the　VOS　task,　they　often　mis-identify　background　noise　as　the　target　or　output　coarse　object　boundaries,　due　to　the　failure　of　mining　detail　information　and　high-order　correlations　of　pixels　within　the　whole　video.　In　this　work,　we　propose　an　edge　attention　gated　graph　convolutional　network　(GCN)　for　VOS.　The　seed　point　initialization　and　graph　construction　stages　construct　a　spatio-temporal　graph　of　the　video　by　exploring　the　spatial　intra-frame　correlation　and　the　temporal　inter-frame　correlation　of　superpixels.　The　node　classification　stage　identifies　foreground　superpixels　by　using　an　edge　attention　gated　GCN　which　mines　higher-order　correlations　between　superpixels　and　propagates　features　among　different　nodes.　The　segmentation　optimization　stage　optimizes　the　classification　of　foreground　superpixels　and　reduces　segmentation　errors　by　using　a　global　appearance　model　which　captures　the　long-term　stable　feature　of　objects.　In　summary,　the　key　contribution　of　our　framework　is　twofold:　(a)　the　spatio-temporal　graph　representation　can　propagate　the　seed　points　of　the　first　frame　to　subsequent　frames　and　facilitate　our　framework　for　the　semi-supervised　VOS　task;　and　(b)　the　edge　attention　gated　GCN　can　learn　the　importance　of　each　node　with　respect　to　both　the　neighboring　nodes　and　the　whole　task　with　a　small　number　of　layers.　Experiments　on　Davis　2016　and　Davis　2017　datasets　show　that　our　framework　achieves　the　excellent　performance　with　only　small　training　samples　(45　video　sequences).

Keyword：

semi-supervised video object segmentation graph convolutional network superpixel spatio-temporal graph model

Author Community：

[ 1 ] [Zhang, Yuqing]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 2 ] [Zhang, Yong]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 3 ] [Wang, Shaofan]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 4 ] [Yin, Baocai]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China
[ 5 ] [Liang, Yun]South China Agr Univ, Guangzhou Key Lab Intelligent Agr, Coll Math & Informat, Guangzhou, Peoples R China

Reprint Author's Address：

[Zhang, Yong]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, 100 Pingleyuan, Beijing 100124, Peoples R China;;

Email：

yuqingz@emails.bjut.edu.cn |
zhangyong2010@bjut.edu.cn |
wangshaofan@bjut.edu.cn |
sdliangyun@163.com |
ybc@bjut.edu.cn

Show more details

Related Keywords：

Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network
2021，25th International Conference on Pattern Recognition (ICPR)
An Integration Model Based on Graph Convolutional Network for Text Classification
2020，IEEE ACCESS
A subgraph sampling method for training large-scale graph convolutional network
2023，INFORMATION SCIENCES
Prompt-supervised dynamic attention graph convolutional network for skeleton-based action recognition
2024，NEUROCOMPUTING

Source ：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS

ISSN： 1551-6857

Year： 2024

Issue： 1

Volume： 20

5 . 1 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：4

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to