• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zhang, Wenbo (Zhang, Wenbo.) | Wu, Chaoyi (Wu, Chaoyi.) | Bao, Zhenshan (Bao, Zhenshan.)

收录:

SCIE

摘要:

The sustainable development of marine fisheries depends on the accurate measurement of data on fish stocks. Semantic segmentation methods based on deep learning can be applied to automatically obtain segmentation masks of fish in images to obtain measurement data. However, general semantic segmentation methods cannot accurately segment fish objects in underwater images. In this study, a Dual Pooling-aggregated Attention Network (DPANet) to adaptively capture long-range dependencies through an efficient and computing-friendly manner to enhance feature representation and improve segmentation performance is proposed. Specifically, a novel pooling-aggregate position attention module and a pooling-aggregate channel attention module are designed to aggregate contexts in the spatial dimension and channel dimension, respectively. These two modules adopt pooling operations along the channel dimension and along the spatial dimension to aggregate information, respectively, thus reducing computational costs. In these modules, attention maps are generated by four different paths and are aggregated into one. The authors conduct extensive experiments to validate the effectiveness of the DPANet and achieve new state-of-the-art segmentation performance on the well-known fish image dataset DeepFish as well as on the underwater image dataset SUIM, achieving a Mean IoU score of 91.08% and 85.39% respectively, while significantly reducing FLOPs of attention modules by about 93%.

关键词:

computer vision convolutional neural nets image segmentation learning (artificial intelligence)

作者机构:

  • [ 1 ] [Zhang, Wenbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Wu, Chaoyi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Zhenshan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

通讯作者信息:

  • [Zhang, Wenbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

IET COMPUTER VISION

ISSN: 1751-9632

年份: 2021

1 . 7 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:11

被引次数:

WoS核心集被引频次: 8

SCOPUS被引频次: 14

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:756/2897570
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司