• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Liang, Xi (Liang, Xi.) | Zhang, Jing (Zhang, Jing.) (学者:张菁) | Zhuo, Li (Zhuo, Li.) | Li, Yuzhao (Li, Yuzhao.) | Tian, Qi (Tian, Qi.)

收录:

EI SCIE

摘要:

Objects in unmanned aerial vehicle (UAV) images are generally small due to the high-photography altitude. Although many efforts have been made in object detection, how to accurately and quickly detect small objects is still one of the remaining open challenges. In this paper, we propose a feature fusion and scaling-based single shot detector (FS-SSD) for small object detection in the UAV images. The FS-SSD is an enhancement based on FSSD, a variety of the original single shot multibox detector (SSD). We add an extra scaling branch of the deconvolution module with an average pooling operation to form a feature pyramid. The original feature fusion branch is adjusted to be better suited to the small object detection task. The two feature pyramids generated by the deconvolution module and feature fusion module are utilized to make predictions together. In addition to the deep features learned by the FS-SSD, to further improve the detection accuracy, spatial context analysis is proposed to incorporate the object spatial relationships into object redetection. The interclass and intraclass distances between different object instances are computed as a spatial context, which proves effective for multiclass small object detection. Six experiments are conducted on the PASCAL VOC dataset and the two UAV image datasets. The experimental results demonstrate that the proposed method can achieve a comparable detection speed but an accuracy superior to those of the six state-of-the-art methods.

关键词:

Deconvolution Detectors Feature extraction feature fusion feature scaling Object detection Photography Remote sensing single shot detector small object detection spatial context analysis Unmanned aerial vehicles Unmanned aerial vehicle (UAV) image

作者机构:

  • [ 1 ] [Liang, Xi]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 2 ] [Zhang, Jing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 3 ] [Zhuo, Li]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 4 ] [Li, Yuzhao]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 5 ] [Zhuo, Li]Collaborat Innovat Ctr Elect Vehicles Beijing, Beijing 100124, Peoples R China
  • [ 6 ] [Tian, Qi]Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
  • [ 7 ] [Tian, Qi]Huawei Technol, Noahs Ark Lab, Shenzhen 518129, Peoples R China

通讯作者信息:

  • 张菁

    [Zhang, Jing]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN: 1051-8215

年份: 2020

期: 6

卷: 30

页码: 1758-1770

8 . 4 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:28

JCR分区:1

被引次数:

WoS核心集被引频次: 132

SCOPUS被引频次: 162

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 3

在线人数/总访问数:6713/2953922
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司