• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Li, Lu (Li, Lu.) | Jia, Maoshen (Jia, Maoshen.) | Wang, Jing (Wang, Jing.) | Cao, Ruiyuan (Cao, Ruiyuan.)

收录:

EI Scopus SCIE

摘要:

This study proposes multiple-speech-source direction -of-arrival (DOA) estimation based on the distribution characteristic of the time-frequency (TF) point dominated by a single-source component (i.e., single-source point, SSP). By exploring the TF distribution characteristics of SSPs, we found that most are distributed in clusters in the TF domain. Hence, the concept of a single-source cluster (SSC) is given, each composed of adjacent TF points from one dominant sound source. Considering that SSCs have different shapes and sizes, an SSC detection method is designed based on point-to-cluster expansion, which is the research focus of this article. A two-dimensional Gaussian function is introduced to model the theoretical distribution of the DOAs of SSPs, and a cluster expansion rule is proposed based on hypothesis testing of the DOA of a source. Two-dimensional kernel density estimation and peak search are adopted to estimate the DOAs and the number of sources using the detected SSCs. Experimental results in both simulated and real environments show that the proposed method can achieve better DOA estimation performance than some current techniques.

关键词:

Estimation Location awareness hypothesis testing Reflection DOA estimation single-source cluster detection Direction-of-arrival estimation Recording Microphone arrays Reverberation

作者机构:

  • [ 1 ] [Li, Lu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Wang, Jing]Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
  • [ 4 ] [Cao, Ruiyuan]Beijing Univ Technol, Fac Sci, Beijing 100124, Peoples R China

通讯作者信息:

查看成果更多字段

相关关键词:

来源 :

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN: 2329-9290

年份: 2023

卷: 31

页码: 3667-3680

5 . 4 0 0

JCR@2022

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:432/4876939
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司