Multiple-Speech-Source DOA Estimation Based on Single-Source Cluster Detection - Details

Author：

Li, Lu (Li, Lu.) | Jia, Maoshen (Jia, Maoshen.) | Wang, Jing (Wang, Jing.) | Cao, Ruiyuan (Cao, Ruiyuan.)

Indexed by：

EI Scopus SCIE

Abstract：

This　study　proposes　multiple-speech-source　direction　-of-arrival　(DOA)　estimation　based　on　the　distribution　characteristic　of　the　time-frequency　(TF)　point　dominated　by　a　single-source　component　(i.e.,　single-source　point,　SSP).　By　exploring　the　TF　distribution　characteristics　of　SSPs,　we　found　that　most　are　distributed　in　clusters　in　the　TF　domain.　Hence,　the　concept　of　a　single-source　cluster　(SSC)　is　given,　each　composed　of　adjacent　TF　points　from　one　dominant　sound　source.　Considering　that　SSCs　have　different　shapes　and　sizes,　an　SSC　detection　method　is　designed　based　on　point-to-cluster　expansion,　which　is　the　research　focus　of　this　article.　A　two-dimensional　Gaussian　function　is　introduced　to　model　the　theoretical　distribution　of　the　DOAs　of　SSPs,　and　a　cluster　expansion　rule　is　proposed　based　on　hypothesis　testing　of　the　DOA　of　a　source.　Two-dimensional　kernel　density　estimation　and　peak　search　are　adopted　to　estimate　the　DOAs　and　the　number　of　sources　using　the　detected　SSCs.　Experimental　results　in　both　simulated　and　real　environments　show　that　the　proposed　method　can　achieve　better　DOA　estimation　performance　than　some　current　techniques.

Keyword：

Estimation Location awareness hypothesis testing Reflection DOA estimation single-source cluster detection Direction-of-arrival estimation Recording Microphone arrays Reverberation

Author Community：

[ 1 ] [Li, Lu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Wang, Jing]Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[ 4 ] [Cao, Ruiyuan]Beijing Univ Technol, Fac Sci, Beijing 100124, Peoples R China

Reprint Author's Address：

Email：

lilubjut@163.com |
jiamaoshen@bjut.edu.cn |
wangjing@bit.edu.cn |
caoruiyuan@bjut.edu.cn

Show more details

Related Keywords：

Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points
2021，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Joint DOA Estimation and Dereverberation Based on Multi-Channel Linear Prediction Filtering and Azimuth Sparsity
2024，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Multi-Source Localization Using Optimized Time-Frequency Representation and Sparsity Component Analysis
2023，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Multi-source DOA estimation in reverberant environments using potential single-source points enhancement
2021，APPLIED ACOUSTICS

Source ：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN： 2329-9290

Year： 2023

Volume： 31

Page： 3667-3680

5 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to