Wiener filtering based speech enhancement with Weighted Denoising Auto-encoder and noise classification - Details

Author：

Xia, Bingyin (Xia, Bingyin.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus SCIE

Abstract：

A　novel　speech　enhancement　method　based　on　Weighted　Denoising　Auto-encoder　(WDA)　and　noise　classification　is　proposed　in　this　paper.　A　weighted　reconstruction　loss　function　is　introduced　into　the　conventional　Denoising　Auto-encoder　(DA),　and　the　relationship　between　the　power　spectra　of　clean　speech　and　noisy　observation　is　described　by　WDA　model.　First,　the　sub-band　power　spectrum　of　clean　speech　is　estimated　by　WDA　model　from　the　noisy　observation.　Then,　the　a　priori　SNR　is　estimated　by　the　a　Posteriori　SNR　Controlled　Recursive　Averaging　(PCRA)　approach.　Finally,　the　clean　speech　is　obtained　by　Wiener　filter　in　frequency　domain.　In　addition,　in　order　to　make　the　proposed　method　suitable　for　various　kinds　of　noise　conditions,　a　Gaussian　Mixture　Model　(GMM)　based　noise　classification　method　is　employed.　And　the　corresponding　WDA　model　is　used　in　the　enhancement　process.　From　the　test　results　under　ITU-T　G.160,　it　is　shown　that,　in　comparison　with　the　reference　method　which　is　the　Wiener　filtering　method　with　decision-directed　approach　for　SNR　estimation,　the　WDA-based　speech　enhancement　methods　could　achieve　better　objective　speech　quality,　no　matter　whether　the　noise　conditions　are　included　in　the　training　set　or　not.　And　the　similar　amount　of　noise　reduction　and　SNR　improvement　can　be　obtained　with　smaller　distortion　on　speech　level.　(C)　2014　Elsevier　B.V.　All　rights　reserved.

Keyword：

Wiener filter Gaussian mixture model Weighted Denoising Auto-encoder Speech enhancement Noise classification SNR estimation

Author Community：

[ 1 ] [Xia, Bingyin]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

鲍长春
[Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

baochch@bjut.edu.cn

Show more details

Related Keywords：

Speech Enhancement with Weighted Denoising Auto-Encoder
2013，14th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2013)
MULTIPLICATIVE UPDATE OF AR GAINS IN CODEBOOK-DRIVEN SPEECH ENHANCEMENT
2016，41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
A CODEBOOK-DRIVFN SPEECH ENHANCEMENT METHOD BY EXPLOITING SPEECH HARMONICITY
2017，IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)
DNN-Based Speech Enhancement via Integrating NMF and CASA
2018，International Conference on Audio, Language and Image Processing (ICALIP)

Source ：

SPEECH COMMUNICATION

ISSN： 0167-6393

Year： 2014

Volume： 60

Page： 13-29

3 . 2 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：188

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 120

SCOPUS Cited Count： 138

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to