• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Gao, Shang (Gao, Shang.) | Jia, Maoshen (Jia, Maoshen.) | Yao, Dingding (Yao, Dingding.) | Wang, Jing (Wang, Jing.)

收录:

EI Scopus SCIE

摘要:

This article aims to address the multi-source localization problem by exploiting the sparsity of the speech signal in the time-frequency domain, where the challenge mainly lies in extracting the sparse component. An optimized time-frequency representation and sparsity component analysis-based multi-source localization method is proposed to overcome this challenge. Firstly, extracting the sparse components relies on the accurate representation in the time-frequency domain. However, the energy leakage problem caused by linear time-frequency transformation limits the accuracy of sparse component extraction. To tackle this problem, inspired by empirical mode decomposition, the proposed method classifies all the points in the time-frequency domain into four categories based on their phase feature and mode characteristics. Each type of the point is modeled separately, and a point-by-point analysis is conducted to remove all the points affected by energy leakage. Then, based on the optimized time-frequency representation, the phase coherence criterion is used to detect the sparse component in the point level. Following that, guided by the mode consistency characteristic of sparse components, an extension scheme is proposed to recover the falsely removed sparse components. Finally, the detected sparse components are applied for the multiple source localization. The objective evaluation is performed in both simulation and actual recording environments, and the proposed method can achieve better localization accuracy compared to several existing methods.

关键词:

Location awareness Direction-of-arrival estimation time-frequency analysis Time-domain analysis sparsity Estimation Time-frequency analysis Microphone arrays Coherence DOA estimation

作者机构:

  • [ 1 ] [Gao, Shang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Yao, Dingding]Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China
  • [ 4 ] [Wang, Jing]Beijing Inst Technol, Dept Elect Engn, Beijing 100081, Peoples R China

通讯作者信息:

查看成果更多字段

相关关键词:

相关文章:

来源 :

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN: 2329-9290

年份: 2023

卷: 31

页码: 3564-3578

5 . 4 0 0

JCR@2022

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:361/4896459
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司