• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Yue, Xiao (Yue, Xiao.) | Qu, Guangzhi (Qu, Guangzhi.) | Liu, Bo (Liu, Bo.) (学者:刘博) | Liu, Anyi (Liu, Anyi.)

收录:

CPCI-S

摘要:

Sound source detection and localization have a lot of practical uses in many industrial settings. Most of sound source direction detection algorithms in literature are designed to identify the angle of sound source in a 2D space. In this work, we propose to use convolutional neural networks to detect the sound source direction in a 3D space. This algorithm is based on the generalized cross correlation method with phase transform (GCC-PHAT) [1] to derive time delay of arrival (TDOA). By using a convolutional neural network model, this algorithm can be applied and deployed. In addition, by modifying GCC-PHAT formula, this approach also works of multiple sound sources detection. Simulation experimental results on single sound source and multiple sound sources detection show the proposed system could work in most situations.

关键词:

Convolutional Neural Network GCC-PHAT Room impulse simulation Sound Source Direction Detection

作者机构:

  • [ 1 ] [Yue, Xiao]Oakland Univ, Comp Sci & Engn Dept, Rochester, MI 48063 USA
  • [ 2 ] [Qu, Guangzhi]Oakland Univ, Comp Sci & Engn Dept, Rochester, MI 48063 USA
  • [ 3 ] [Liu, Anyi]Oakland Univ, Comp Sci & Engn Dept, Rochester, MI 48063 USA
  • [ 4 ] [Liu, Bo]Beijing Univ Technol, Sch Software Engn, Fac Informat Technol, Beijing, Peoples R China

通讯作者信息:

  • [Yue, Xiao]Oakland Univ, Comp Sci & Engn Dept, Rochester, MI 48063 USA

查看成果更多字段

相关关键词:

相关文章:

来源 :

2018 FIRST IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2018)

年份: 2018

页码: 81-84

语种: 英文

被引次数:

WoS核心集被引频次: 4

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 5

在线人数/总访问数:333/2897107
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司