首页>成果

  • 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

[期刊论文]

Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points

分享
编辑 删除 报错

作者:

Jia, Maoshen (Jia, Maoshen.) | Wu, Yuxuan (Wu, Yuxuan.) | Bao, Changchun (Bao, Changchun.) (学者:鲍长春) | 展开

收录:

EI Scopus SCIE

摘要:

In this article, the direction of arrival (DOA) estimation of multiple speech sources in reverberant environments is investigated based on the recording of a soundfield microphone. First, the recordings are analyzed in the time-frequency (T-F) domain to detect both 'points' (single T-F points) and 'regions' (multiple, adjacent T-F points) corresponding to a single source with low reverberation (known as low-reverberant-single-source (LRSS) points). Then, a LRSS point detection algorithm is proposed based on a joint dominance measure and instantaneous single-source point (SSP) identification. Following this, initial DOA estimates obtained for the detected LRSS points are analyzed using a Gaussian Mixture Model (GMM) derived by the Expectation-Maximization (EM) algorithm to cluster components into sources or outliers using a rule-based method. Finally, the DOA of each actual source is obtained from the estimated source components. Experiments on both simulated data and data recorded in an actual acoustic chamber demonstrate that the proposed algorithm exhibits improved performance for the DOA estimation in reverberant environments when compared to several existing approaches. © 2014 IEEE.

关键词:

Gaussian distribution Reverberation Frequency domain analysis Frequency estimation Clustering algorithms Maximum principle Direction of arrival Image segmentation Audio recordings

作者机构:

  • [ 1 ] [Jia, Maoshen]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Wu, Yuxuan]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Bao, Changchun]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Ritz, Christian]School of Electrical Computer and Telecommunications Engineering, University of Wollongong, Wollongong; NSW; 2500, Australia

通讯作者信息:

  • [jia, maoshen]faculty of information technology, beijing university of technology, beijing; 100124, china

查看成果更多字段

相关文章:

来源 :

ACM Transactions on Audio Speech and Language Processing

ISSN: 2329-9290

年份: 2021

卷: 29

页码: 379-392

5 . 4 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:87

JCR分区:1

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 20

近30日浏览量: 1

归属院系:

在线人数/总访问数:155/4723143
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司