Linear Prediction-based Part-defined Auto-encoder Used for Speech Enhancement - Details

Author：

Cui, Zihao (Cui, Zihao.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus

Abstract：

This　paper　proposes　a　linear　prediction-based　part-defined　auto-encoder　(PAE)　network　to　enhance　speech　signal.　The　PAE　is　a　defined　decoder　or　a　defined　encoder　network,　based　on　efficient　learning　algorithm　or　classical　model.　In　this　paper,　the　PAE　utilizes　AR-Wiener　filter　as　decoder　part,　and　the　AR-Wiener　filter　is　modified　as　a　linear　prediction　(LP)　model　by　incorporating　the　modified　factor　from　residual　signal.　The　parameters　of　line　spectral　frequency　(LSF)　of　speech　and　noise　and　the　Wiener　filtering　mask　are　utilized　for　training　targets.　Finally,　the　proposed　the　LP-based　PAE　is　compared　with　the　baseline　method,　namely　the　Wiener　filtering　mask-based　DNN.　The　PESQ　and　STOI　results　of　the　LP-based　PAE　are　better　than　baseline　method　at　lower　signal　noise　ratio　(SNR)　levels.　©　2019　IEEE.

Keyword：

Speech communication Audio signal processing Signal encoding Forecasting Decoding Acoustic noise Speech enhancement Signal to noise ratio Learning systems Learning algorithms

Author Community：

[ 1 ] [Cui, Zihao]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Bao, Changchun]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

IRM with phase parameterization for speech enhancement
2019，2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019
Phase unwrapping based speech enhancement
2019，2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
An ideal wiener filter correction-based cIRM speech enhancement method using deep neural networks with skip connections
2018，14th IEEE International Conference on Signal Processing, ICSP 2018
A high throughput LDPC decoder using a mid-range GPU
2014，2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014

Source ：

ISSN： 1520-6149

Year： 2019

Volume： 2019-May

Page： 6880-6884

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to