• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Deng, Feng (Deng, Feng.) | Bao, Changchun (Bao, Changchun.) (学者:鲍长春) | Kleijn, W. Bastiaan (Kleijn, W. Bastiaan.)

收录:

EI Scopus SCIE

摘要:

We propose a sparse hidden Markov model (HMM)-based single- channel speech enhancement method that models the speech and noise gains accurately in non- stationary noise environments. Autoregressive models are employed to describe the speech and noise in a unified framework and the speech and noise gains are modeled as random processes with memory. The likelihood criterion for finding the model parameters is augmented with an l(p) regularization term resulting in a sparse autoregressive HMM (SARHMM) system that encourages sparsity in the speech- and noise- modeling. In the SARHMM only a small number of HMM states contribute significantly to the model of each particular observed speech segment. As it eliminates ambiguity between noise and speech spectra, the sparsity of speech and noise modeling helps to improve the tracking of the changes of both spectral shapes and power levels of non-stationary noise. Using the modeled speech and noise SARHMMs, we first construct a noise estimator to estimate the noise power spectrum. Then, a Bayesian speech estimator is derived to obtain the enhanced speech signal. The subjective and objective test results indicate that the proposed speech enhancement scheme can achieve a larger segmental SNR improvement, a lower log- spectral distortion and a better speech quality in stationary noise conditions than state-of-the-art reference methods. The advantage of the new method is largest for non-stationary noise conditions.

关键词:

non-stationary noise speech enhancement sparse autoregressive hidden Markov model (ARHMM) Gain modeling

作者机构:

  • [ 1 ] [Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 3 ] [Kleijn, W. Bastiaan]Victoria Univ Wellington, Sch Engn & Comp Sci, Commun & Signal Proc Grp, Wellington 6140, New Zealand

通讯作者信息:

  • [Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN: 2329-9290

年份: 2015

期: 11

卷: 23

页码: 1973-1987

5 . 4 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:174

JCR分区:2

中科院分区:2

被引次数:

WoS核心集被引频次: 27

SCOPUS被引频次: 33

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:617/3901110
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司