Multi-Source Localization Using Optimized Time-Frequency Representation and Sparsity Component Analysis - Details

Author：

Gao, Shang (Gao, Shang.) | Jia, Maoshen (Jia, Maoshen.) | Yao, Dingding (Yao, Dingding.) | Wang, Jing (Wang, Jing.)

Indexed by：

EI Scopus SCIE

Abstract：

This　article　aims　to　address　the　multi-source　localization　problem　by　exploiting　the　sparsity　of　the　speech　signal　in　the　time-frequency　domain,　where　the　challenge　mainly　lies　in　extracting　the　sparse　component.　An　optimized　time-frequency　representation　and　sparsity　component　analysis-based　multi-source　localization　method　is　proposed　to　overcome　this　challenge.　Firstly,　extracting　the　sparse　components　relies　on　the　accurate　representation　in　the　time-frequency　domain.　However,　the　energy　leakage　problem　caused　by　linear　time-frequency　transformation　limits　the　accuracy　of　sparse　component　extraction.　To　tackle　this　problem,　inspired　by　empirical　mode　decomposition,　the　proposed　method　classifies　all　the　points　in　the　time-frequency　domain　into　four　categories　based　on　their　phase　feature　and　mode　characteristics.　Each　type　of　the　point　is　modeled　separately,　and　a　point-by-point　analysis　is　conducted　to　remove　all　the　points　affected　by　energy　leakage.　Then,　based　on　the　optimized　time-frequency　representation,　the　phase　coherence　criterion　is　used　to　detect　the　sparse　component　in　the　point　level.　Following　that,　guided　by　the　mode　consistency　characteristic　of　sparse　components,　an　extension　scheme　is　proposed　to　recover　the　falsely　removed　sparse　components.　Finally,　the　detected　sparse　components　are　applied　for　the　multiple　source　localization.　The　objective　evaluation　is　performed　in　both　simulation　and　actual　recording　environments,　and　the　proposed　method　can　achieve　better　localization　accuracy　compared　to　several　existing　methods.

Keyword：

Location awareness Direction-of-arrival estimation time-frequency analysis Time-domain analysis sparsity Estimation Time-frequency analysis Microphone arrays Coherence DOA estimation

Author Community：

[ 1 ] [Gao, Shang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Yao, Dingding]Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China
[ 4 ] [Wang, Jing]Beijing Inst Technol, Dept Elect Engn, Beijing 100081, Peoples R China

Reprint Author's Address：

Email：

gaoshang9795@163.com |
jiamaoshen@bjut.edu.cn |
yaodingding@hccl.ioa.ac.cn |
wangjing@bit.edu.cn

Show more details

Related Keywords：

Joint DOA Estimation and Dereverberation Based on Multi-Channel Linear Prediction Filtering and Azimuth Sparsity
2024，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Multiple-Speech-Source DOA Estimation Based on Single-Source Cluster Detection
2023，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points
2021，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Location-Aware Anti-Collision Protocol for Energy Efficient Passive RFID System
2014，International Conference on Indoor Positioning and Indoor Navigation (IPIN)

Source ：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN： 2329-9290

Year： 2023

Volume： 31

Page： 3564-3578

5 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to