Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments - Details

Author：

Deng, Feng (Deng, Feng.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Kleijn, W. Bastiaan (Kleijn, W. Bastiaan.)

Indexed by：

EI Scopus SCIE

Abstract：

We　propose　a　sparse　hidden　Markov　model　(HMM)-based　single-　channel　speech　enhancement　method　that　models　the　speech　and　noise　gains　accurately　in　non-　stationary　noise　environments.　Autoregressive　models　are　employed　to　describe　the　speech　and　noise　in　a　unified　framework　and　the　speech　and　noise　gains　are　modeled　as　random　processes　with　memory.　The　likelihood　criterion　for　finding　the　model　parameters　is　augmented　with　an　l(p)　regularization　term　resulting　in　a　sparse　autoregressive　HMM　(SARHMM)　system　that　encourages　sparsity　in　the　speech-　and　noise-　modeling.　In　the　SARHMM　only　a　small　number　of　HMM　states　contribute　significantly　to　the　model　of　each　particular　observed　speech　segment.　As　it　eliminates　ambiguity　between　noise　and　speech　spectra,　the　sparsity　of　speech　and　noise　modeling　helps　to　improve　the　tracking　of　the　changes　of　both　spectral　shapes　and　power　levels　of　non-stationary　noise.　Using　the　modeled　speech　and　noise　SARHMMs,　we　first　construct　a　noise　estimator　to　estimate　the　noise　power　spectrum.　Then,　a　Bayesian　speech　estimator　is　derived　to　obtain　the　enhanced　speech　signal.　The　subjective　and　objective　test　results　indicate　that　the　proposed　speech　enhancement　scheme　can　achieve　a　larger　segmental　SNR　improvement,　a　lower　log-　spectral　distortion　and　a　better　speech　quality　in　stationary　noise　conditions　than　state-of-the-art　reference　methods.　The　advantage　of　the　new　method　is　largest　for　non-stationary　noise　conditions.

Keyword：

non-stationary noise speech enhancement sparse autoregressive hidden Markov model (ARHMM) Gain modeling

Author Community：

[ 1 ] [Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 3 ] [Kleijn, W. Bastiaan]Victoria Univ Wellington, Sch Engn & Comp Sci, Commun & Signal Proc Grp, Wellington 6140, New Zealand

Reprint Author's Address：

[Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

dengfeng@emails.bjut.edu.cn |
baochch@bjut.edu.cn |
bastiaan.kleijn@ecs.vuw.ac.nz

Show more details

Related Keywords：

SPARSE HMM-BASED SPEECH ENHANCEMENT METHOD FOR STATIONARY AND NON-STATIONARY NOISE ENVIRONMENTS
2015，40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Speech enhancement algorithm based on wavelet transform
2009，Journal of Data Acquisition and Processing
Multi-channel Speech Enhancement with Multiple-target GANs
2020，10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC)
A fast adaptive Kalman filtering algorithm for speech enhancement
2011，2011 7th IEEE International Conference on Automation Science and Engineering, CASE 2011

Source ：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN： 2329-9290

Year： 2015

Issue： 11

Volume： 23

Page： 1973-1987

5 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：174

JCR Journal Grade：2

CAS Journal Grade：2

Cited Count：

WoS CC Cited Count： 27

SCOPUS Cited Count： 33

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to