An improved dictionary learning method for speech enhancement - Details

Author：

Hao, Yue (Hao, Yue.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus

Abstract：

In　this　paper,　an　improved　dictionary　learning　method　for　speech　enhancement　is　proposed.　Given　prior　information　of　the　noise,　the　dictionaries　of　speech　and　noise　are　firstly　trained　by　an　approximate　KSVD　algorithm,　respectively.　Then,　the　estimated　short-time　Fourier　transform　(STFT)　magnitudes　of　speech　and　noise　can　be　sparsely　represented　by　multiplying　the　dictionary　with　sparse　coefficients,　which　are　calculated　by　the　least　angle　regression　(LAR)　algorithm.　A　geometrical　stopping　criterion　with　an　adaptive　threshold　is　utilized　to　adjust　the　conventional　stopping　criterion　in　LAR　algorithm　so　that　it　can　increase　the　adaptability　of　LAR.　Next,　we　propose　a　framework　that　utilizes　the　expectation　maximization　(EM)　method　to　refine　the　energy　of　the　estimated　speech　and　noise　in　order　to　obtain　more　accurate　estimation　of　STFT　magnitudes.　Finally,　a　modified　wiener　filter　is　constructed　to　further　eliminate　residual　noise.　When　the　prior　information　of　noise　is　unknown,　an　online　noise　estimation　method　is　applied　to　replace　the　noise　dictionary.　The　test　results　show　that　the　proposed　method　outperforms　the　reference　speech　enhancement　methods.　©　2015　Asia-Pacific　Signal　and　Information　Processing　Association.

Keyword：

Learning systems Speech enhancement Maximum principle

Author Community：

[ 1 ] [Hao, Yue]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Bao, Changchun]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Joint ideal ratio mask and generative adversarial networks for monaural speech enhancement
2018，14th IEEE International Conference on Signal Processing, ICSP 2018
Speech enhancement based on binaural cues
2017，9th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2017
Linear Prediction-based Part-defined Auto-encoder Used for Speech Enhancement
2019，44th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019
A deep-learning method for radar micro-doppler spectrogram restoration
2020，Sensors (Switzerland)

Source ：

Year： 2015

Page： 144-147

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to