SPEECH ENHANCEMENT VIA GENERATIVE ADVERSARIAL LSTM NETWORKS - Details

Author：

Xiang, Yang (Xiang, Yang.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

CPCI-S

Abstract：

Recently,　deep　learning　techniques　have　significantly　promoted　the　development　of　speech　enhancement.　In　this　paper,　we　propose　a　novel　framework　to　conduct　speech　enhancement,　which　is　based　on　the　long　short-term　memory　networks　(LSTMs)　and　conditional　generative　adversarial　networks　(cGANs).　This　framework　includes　a　generator　(G)　and　a　discriminator　(D).　G　and　D　are　both　LSTMs　so　our　method　is　able　to　be　more　suitable　for　speech　enhancement　task　than　previous　deep　neural　network-based　methods.　In　this　study,　we　firstly　apply　this　framework　to　map　the　log-power　spectral　(LPS)　of　clean　speech　given　the　noisy　LPS　input.　In　addition,　this　framework　is　also　used　to　estimate　the　ideal　Wiener　filter　by　giving　the　noisy　Cepstral　input.　Experimental　results　indicate　that　our　strategy　can　not　only　improve　the　quality　and　intelligibility　of　noisy　speech,　but　also　is　competitive　to　other　deep　learning-based　approaches.

Keyword：

long short-term memory networks speech enhancement deep learning generative adversarial networks

Author Community：

[ 1 ] [Xiang, Yang]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China

Reprint Author's Address：

[Xiang, Yang]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China

Email：

xiangyang3131777@emails.bjut.edu.cn |
baochch@bjut.edu.cn

Show more details

Related Keywords：

Transfer learning-based multiple digital twin-assisted intelligent mechanical fault diagnosis
2024，MEASUREMENT SCIENCE AND TECHNOLOGY
Multi-channel Speech Enhancement with Multiple-target GANs
2020，10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC)
Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement
2018，14th IEEE International Conference on Signal Processing (ICSP)
Joint ideal ratio mask and generative adversarial networks for monaural speech enhancement
2018，14th IEEE International Conference on Signal Processing, ICSP 2018

Source ：

2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC)

ISSN： 2639-4316

Year： 2018

Page： 46-50

Language： English

Cited Count：

WoS CC Cited Count： 7

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to