Multi-channel Speech Enhancement with Multiple-target GANs - Details

Author：

Yuan, Jing (Yuan, Jing.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

CPCI-S

Abstract：

In　noisy　scenes,　speech　enhancement　is　an　important　technology　to　improve　the　speech　quality.　In　this　paper,　a　multi-channel　speech　enhancement　algorithm　with　multiple-target　Generative　Adversarial　Networks　(GANs)　is　proposed.　Firstly,　using　the　spatial　characteristics　of　microphone　array,　the　mask　of　target　speech　signal　is　generated　by　the　multiple-target　GAN　(MT-GAN).　Secondly,　the　mask　is　estimated　based　on　complex　Gaussian　mixture　model　(CGMM),　which　is　combined　with　the　mask　predicted　by　network　in　an　iterative　way　to　obtain　a　more　robust　speech　enhancement　system.　Finally,　the　estimated　mask　is　used　to　construct　beamformer.　Thus,　the　noisy　speech　is　enhanced　by　the　constructed　beamformer.　The　experimental　results　show　that　compared　with　the　reference　methods,　the　speech　quality　and　intelligibility　of　the　proposed　method　are　improved　effectively.

Keyword：

beamforming speech enhancement deep learning generative adversarial networks

Author Community：

[ 1 ] [Yuan, Jing]Beijing Univ Technol, Fac Informat Techol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Techol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

[Yuan, Jing]Beijing Univ Technol, Fac Informat Techol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

yuanjings@emails.bjut.edu.cn |
baochch@bjut.edu.cn

Show more details

Related Keywords：

Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter
2020，10th IEEE International Conference on Signal Processing, Communications and Computing (IEEE ICSPCC)
Beamforming-based Speech Enhancement based on Optimal Ratio Mask
2019，IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)
Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement
2018，14th IEEE International Conference on Signal Processing (ICSP)
Joint ideal ratio mask and generative adversarial networks for monaural speech enhancement
2018，14th IEEE International Conference on Signal Processing, ICSP 2018

Source ：

2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020)

Year： 2020

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to