• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Gadosey, Pius Kwao (Gadosey, Pius Kwao.) | Li, Yuijan (Li, Yuijan.) | Zhang, Ting (Zhang, Ting.) | Liu, Zhaoying (Liu, Zhaoying.) | Too, Edna Chebet (Too, Edna Chebet.) | Essaf, Firdaous (Essaf, Firdaous.)

收录:

EI

摘要:

As a research area of computer vision and deep learning, scene understanding has attracted a lot of attention in recent years. One major challenge encountered is obtaining high levels of segmentation accuracy while dealing with the computational cost and time associated with training or inference. Most current algorithms compromise one metric for the other depending on the intended devices. To address this problem, this paper proposes a novel deep neural network architecture called Segmentation Efficient Blocks Network (SEB-Net) that seeks to achieve the best possible balance between accuracy and computational costs as well as real-time inference speed. The model is composed of both an encoder path and a decoder path in a symmetric structure. The encoder path consists of 16 convolution layers identical to a VGG-19 model, and the decoder path includes what we call E-blocks (Efficient Blocks) inspired by the widely popular ENet architecture's bottleneck module with slight modifications. One advantage of this model is that the max-unpooling in the decoder path is employed for expansion and projection convolutions in the E-Blocks, allowing for less learnable parameters and efficient computation (10.1 frames per second (fps) for a 480x320 input, 11x fewer parameters than DeconvNet, 52.4 GFLOPs for a 640x360 input on a TESLA K40 GPU device). Experimental results on two outdoor scene datasets; Cambridge-driving Labeled Video Database (CamVid) and Cityscapes, indicate that SEB-Net can achieve higher performance compared to Fully Convolutional Networks (FCN), SegNet, DeepLabV, and Dilation8 in most cases. What's more, SEB-Net outperforms efficient architectures like ENet and LinkNet by 16.1 and 11.6 respectively in terms of Instance-level intersection over Union (iLoU). SEB-Net also shows better performance when further evaluated on the SUNRGB-D, an indoor scene dataset © 2020 ACM.

关键词:

Convolution Convolutional neural networks Decoding Deep learning Deep neural networks Network architecture Network coding

作者机构:

  • [ 1 ] [Gadosey, Pius Kwao]Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Li, Yuijan]School of Artificial Intelligence, Guilin University of Electronic Technology, Guilin; 541004, China
  • [ 3 ] [Zhang, Ting]Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Liu, Zhaoying]Beijing University of Technology, Beijing; 100124, China
  • [ 5 ] [Too, Edna Chebet]Department of Computer Science and Ict, Chuka University, Kenya
  • [ 6 ] [Essaf, Firdaous]Beijing University of Technology, Beijing; 100124, China

通讯作者信息:

  • [li, yuijan]school of artificial intelligence, guilin university of electronic technology, guilin; 541004, china

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

年份: 2020

页码: 542-551

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 1

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:146/2889914
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司