Speech enhancement based on auditory cepstral coefficient with deep learning - Details

Author：

Li, Ruwei (Li, Ruwei.) | Sun, Xiaoyue (Sun, Xiaoyue.) | Liu, Yanan (Liu, Yanan.) | Li, Tao (Li, Tao.)

Indexed by：

EI PKU CSCD

Abstract：

The　performance　of　the　existing　speech　enhancement　algorithms　is　not　ideal　in　low　signal-to-noise　ratio　(SNR)　non-stationary　noise　environments.In　order　to　resolve　this　problem,　a　novel　speech　enhancement　algorithm　was　presented.First,　a　fully　connected　deep　neural　network　(DNN)　was　constructed,　and　a　multi-resolution　auditory　cepstral　coefficient　(MRACC)　was　extracted　from　four　cochleagrams　of　different　resolutions　as　the　input　of　neural　network,　which　could　capture　the　local　information　and　spectrotemporal　context.Second,　an　adaptive　mask　(AM)　which　can　adjust　the　weight　of　ideal　binary　mask　(IBM)　and　ideal　ratio　mask　(IRM)　according　to　noise　change　was　put　forward　in　this　paper.Finally,　the　estimated　AM　was　used　to　achieve　the　enhanced　speech.The　proposed　algorithm　shows　that　it　not　only　further　improves　speech　quality　and　intelligibility,　but　also　suppresses　more　noise　than　the　contrast　algorithms　by　experimental　results.　©　2019,　Editorial　Board　of　Journal　of　Huazhong　University　of　Science　and　Technology.　All　right　reserved.

Keyword：

Speech enhancement Signal to noise ratio Deep learning Deep neural networks Neural networks Speech intelligibility Amplitude modulation

Author Community：

[ 1 ] [Li, Ruwei]College of Information and Communications Engineering Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Sun, Xiaoyue]College of Information and Communications Engineering Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Liu, Yanan]College of Information and Communications Engineering Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Li, Tao]College of Information and Communications Engineering Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

An ideal wiener filter correction-based cIRM speech enhancement method using deep neural networks with skip connections
2018，14th IEEE International Conference on Signal Processing, ICSP 2018
Phase unwrapping based speech enhancement
2019，2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
Speech Enhancement with Phase Correction based on Modified DNN Architecture
2018，10th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2018
Speech enhancement via generative adversarial LSTM networks
2018，16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
Speech enhancement based on cepstral mapping and deep neural networks
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018

Source ：

Journal of Huazhong University of Science and Technology (Natural Science Edition)

ISSN： 1671-4512

Year： 2019

Issue： 9

Volume： 47

Page： 78-83

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to