Speech Enhancement Integrating the MVDR Beamforming and T-F Masking - Details

Author：

Zhu, Jinru (Zhu, Jinru.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Cheng, Rui (Cheng, Rui.)

Indexed by：

CPCI-S

Abstract：

In　this　paper,　a　multi-channel　speech　enhancement　method　with　the　minimum　variance　distortionless　response　(MVDR)　beamforming　method　based　on　the　time-frequency　(T-F)　masking　is　proposed.　In　this　study,　First,　the　logarithmic　power　spectrum　(LPS)　features　of　multi-channel　signals　are　used　as　input　features　to　estimate　a　T-F　mask　of　the　reference　microphone　by　the　deep　neural　network　(DNN)　model.　Then,　the　estimated　mask　is　utilized　to　calculate　speech　covariance　matrix　that　is　used　to　estimate　a　steering　vector　for　constructing　the　MVDR　beamformer.　The　steering　vector　is　estimated　by　the　generalized　eigen-value　decomposition　(GEVD)　method.　Finally,　the　output　speech　of　the　beamformer　is　processed　by　the　DNN-based　IRM　model.　In　order　to　prove　the　effectiveness　of　the　proposed　method,　the　perceptual　evaluation　of　speech　quality　(PESQ)　and　the　segment　signal-to-noise　ratio　(SSNR)　are　employed.　The　experimental　results　show　that　the　proposed　method　effectively　increased　the　PESQ　and　SSNR.

Keyword：

multi-channel speech enhancement T-F masking DNN MVDR beamforming

Author Community：

[ 1 ] [Zhu, Jinru]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 3 ] [Cheng, Rui]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zhu, Jinru]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

zhujinru@emails.bjut.edu.cn |
baochch@bjut.edu.cn |
chengrui@emails.bjut.edu.cn

Show more details

Related Keywords：

Speech enhancement integrating the MVDR beamforming and T-F masking
2019，2019 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2019
Study of MVDR Beamforming with Spatially Distributed Source: Theoretical Analysis and Efficient Microphone Array Geometry Optimization Method
2023，CIRCUITS SYSTEMS AND SIGNAL PROCESSING
Design of a robust MVDR beamforming method with Low-Latency by reconstructing covariance matrix for speech enhancement
2023，APPLIED ACOUSTICS
Beamforming-based Speech Enhancement based on Optimal Ratio Mask
2019，IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Source ：

CONFERENCE PROCEEDINGS OF 2019 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2019)

Year： 2019

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to