• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Deng, Feng (Deng, Feng.) | Bao, Feng (Bao, Feng.) | Bao, Chang-chun (Bao, Chang-chun.) (学者:鲍长春)

收录:

EI Scopus SCIE

摘要:

In this paper, a single-channel speech enhancement method based on generalized weighted beta-order spectral amplitude estimator is proposed. First, we derive a new kind of generalized weighted beta-order Bayesian spectral amplitude estimator, which takes full advantage of both the traditional perceptually weighted estimators and beta-order spectral amplitude estimators and can obtain flexible and effective gain function. Second, according to the masking properties of human auditory system, the adaptive estimation methods for the perceptually weighted order p is proposed, which is based on a criterion that inaudible noise may be masked rather than removed. Thereby, the distortion of enhanced speech is reduced. Third, based on the compressive nonlinearity of the cochlea, the spectral amplitude order beta can be interpreted as the compression rate of the spectral amplitude, and then the adaptive calculation method of parameter beta is proposed. In addition, due to one frame delay, the a priori SNR estimation of decision-directed method in speech activity periods is inaccurate. In order to overcome the drawback, we present a new a priori SNR estimation method by combining predicted estimation with decision-directed rule. The subjective and objective test results indicate that the proposed Bayesian spectral amplitude estimator combined with the proposed a priori SNR estimation method can achieve a more significant segmental SNR improvement, a lower log-spectral distortion and a better speech quality over the reference methods. (C) 2014 Elsevier B.V. All rights reserved.

关键词:

Generalized weighed spectral amplitude estimator Speech enhancement Auditory masking properties A priori SNR estimation

作者机构:

  • [ 1 ] [Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Chang-chun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

通讯作者信息:

  • 鲍长春

    [Bao, Chang-chun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

SPEECH COMMUNICATION

ISSN: 0167-6393

年份: 2014

卷: 59

页码: 55-68

3 . 2 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:188

JCR分区:2

中科院分区:3

被引次数:

WoS核心集被引频次: 14

SCOPUS被引频次: 16

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 4

归属院系:

在线人数/总访问数:629/4288512
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司