Speech enhancement via generative adversarial LSTM networks - Details

Author：

Xiang, Yang (Xiang, Yang.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus

Abstract：

Recently,　deep　learning　techniques　have　significantly　promoted　the　development　of　speech　enhancement.　In　this　paper,　we　propose　a　novel　framework　to　conduct　speech　enhancement,　which　is　based　on　the　long　short-term　memory　networks　(LSTMs)　and　conditional　generative　adversarial　networks　(cGANs).　This　framework　includes　a　generator　(G)　and　a　discriminator　(D).　G　and　D　are　both　LSTMs　so　our　method　is　able　to　be　more　suitable　for　speech　enhancement　task　than　previous　deep　neural　network-based　methods.　In　this　study,　we　firstly　apply　this　framework　to　map　the　log-power　spectral　(LPS)　of　clean　speech　given　the　noisy　LPS　input.　In　addition,　this　framework　is　also　used　to　estimate　the　ideal　Wiener　filter　by　giving　the　noisy　Cepstral　input.　Experimental　results　indicate　that　our　strategy　can　not　only　improve　the　quality　and　intelligibility　of　noisy　speech,　but　also　is　competitive　to　other　deep　learning-based　approaches.　©　2018　IEEE.

Keyword：

Acoustic waves Deep neural networks Long short-term memory Speech intelligibility Brain Speech enhancement Deep learning

Author Community：

[ 1 ] [Xiang, Yang]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Lab, Beijing; 100124, China
[ 2 ] [Bao, Changchun]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Lab, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

DNN-based speech enhancement using MBE model
2018，16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
SPEECH ENHANCEMENT VIA GENERATIVE ADVERSARIAL LSTM NETWORKS
2018，16th International Workshop on Acoustic Signal Enhancement (IWAENC)
Research on Pattern Recognition Performance of Control Chart Based on Deep Learning
2022，2022 Global Conference on Robotics, Artificial Intelligence and Information Technology, GCRAIT 2022
Single Channel Speech Enhancement Algorithm based on BLSTM-DNN Bidirectional Optimized Hybrid Model
2020，3rd Annual International Conference on Cloud Technology and Communication Engineering (CTCE)

Source ：

Year： 2018

Page： 46-50

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 12

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to