• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Li, Ruwei (Li, Ruwei.) | Li, Tao (Li, Tao.) | Sun, Xiaoyue (Sun, Xiaoyue.) | Yang, Dengcai (Yang, Dengcai.) | Wang, Qi (Wang, Qi.)

收录:

EI PKU CSCD

摘要:

The performance of the existing target localization algorithms is not ideal in complex acoustic environment. In order to improve this problem, a novel target binaural sound localization algorithm is presented. First, the algorithm uses binaural spectral features as input of a time-frequency units selector based on deep learning. Then, to reduce the negative impact of the time-frequency unit belonging to noise on the localization accuracy, the selector is emploied to select the reliable time-frequency units from binaural input sound signal. At the same time, a Deep Neural Network (DNN)-based localization system maps the binaural cues of each time-frequency unit to the azimuth posterior probability. Finally, the target localization is completed according to the azimuth posterior probability belonging to the reliable time-frequency units. Experimental results show that the performance of the proposed algorithm is better than comparison algorithms and achieves a significant improvement in target localization accuracy in low Signal-to-Noise Ratio(SNR) and various reverberation environments, especially when there is noise similar to the target sound source. © 2019, Science Press. All right reserved.

关键词:

Acoustic generators Acoustic noise Deep learning Deep neural networks Signal to noise ratio

作者机构:

  • [ 1 ] [Li, Ruwei]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Li, Tao]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Sun, Xiaoyue]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Yang, Dengcai]Institute of Science and Technology Development, Beijing University of Technology, Beijing; 100124, China
  • [ 5 ] [Wang, Qi]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

通讯作者信息:

  • [li, ruwei]laboratory of speech and audio signal processing and institute of artificial intelligence, faculty of information technology, beijing university of technology, beijing; 100124, china

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

Journal of Electronics and Information Technology

ISSN: 1009-5896

年份: 2019

期: 12

卷: 41

页码: 2932-2938

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:152/3603730
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司