Indexed by:
Abstract:
A novel speech enhancement method based on Weighted Denoising Auto-encoder (WDA) and noise classification is proposed in this paper. A weighted reconstruction loss function is introduced into the conventional Denoising Auto-encoder (DA), and the relationship between the power spectra of clean speech and noisy observation is described by WDA model. First, the sub-band power spectrum of clean speech is estimated by WDA model from the noisy observation. Then, the a priori SNR is estimated by the a Posteriori SNR Controlled Recursive Averaging (PCRA) approach. Finally, the clean speech is obtained by Wiener filter in frequency domain. In addition, in order to make the proposed method suitable for various kinds of noise conditions, a Gaussian Mixture Model (GMM) based noise classification method is employed. And the corresponding WDA model is used in the enhancement process. From the test results under ITU-T G.160, it is shown that, in comparison with the reference method which is the Wiener filtering method with decision-directed approach for SNR estimation, the WDA-based speech enhancement methods could achieve better objective speech quality, no matter whether the noise conditions are included in the training set or not. And the similar amount of noise reduction and SNR improvement can be obtained with smaller distortion on speech level. (C) 2014 Elsevier B.V. All rights reserved.
Keyword:
Reprint Author's Address:
Email:
Source :
SPEECH COMMUNICATION
ISSN: 0167-6393
Year: 2014
Volume: 60
Page: 13-29
3 . 2 0 0
JCR@2022
ESI Discipline: COMPUTER SCIENCE;
ESI HC Threshold:188
JCR Journal Grade:2
CAS Journal Grade:3
Cited Count:
WoS CC Cited Count: 120
SCOPUS Cited Count: 138
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 2
Affiliated Colleges: