Compressed domain speech enhancement method based on ITU-T G.722.2 - Details

Author：

Xia, Bingyin (Xia, Bingyin.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus SCIE

Abstract：

Based　on　the　bit-stream　of　ITU-T　G.722.2　speech　coding　standard,　through　the　modification　of　codebook　gains　in　the　codec,　a　compressed　domain　speech　enhancement　method　that　is　compatible　with　the　discontinuous　transmission　(DTX)　mode　and　frame　erasure　condition　is　proposed　in　this　paper.　In　non-DTX　mode,　the　Voice　Activity　Detection　(VAD)　is　carried　out　in　the　compressed　domain,　and　the　background　noise　is　classified　into　full-band　distributed　noise　and　low-frequency　distributed　noise.　Then,　the　noise　intensity　is　estimated　based　on　the　algebraic　codebook　power,　and　the　a　priori　SNR　is　estimated　according　to　the　noise　type.　Next,　the　codebook　gains　are　jointly　modified　under　the　rule　of　energy　compensation.　Especially,　the　adaptive　comb　filter　is　adopted　to　remove　the　residual　noise　in　the　excitation　signal　in　low-frequency　distributed　noise.　Finally,　the　modified　codebook　gains　are　re-quantized　in　speech　or　excitation　domain.　For　non-speech　frames　in　DTX　mode,　the　logarithmic　frame　energy　is　attenuated　to　remove　the　noise,　while　the　spectral　envelope　is　kept　unchanged.　When　frame　erasure　occurs,　the　recovered　algebraic　codebook　gain　is　exponentially　attenuated,　and　based　on　the　reconstructed　algebraic　codebook　vector,　all　the　codec　parameters　are　re-quantized　to　form　the　error　concealed　bit-stream.　The　result　of　performance　evaluation　under　ITU-T　G.160　shows　that,　with　much　lower　computational　complexity,　better　noise　reduction,　SNR　improvement,　and　objective　speech　quality　performances　are　achieved　by　the　proposed　method　comparing　with　the　state-of-art　compressed　domain　methods.　The　subjective　speech　quality　test　shows　that,　the　speech　quality　of　the　proposed　method　is　better　than　the　method　that　only　modifies　the　algebraic　codebook　gain,　and　similar　to　the　one　with　the　assistance　of　linear　domain　speech　enhancement　method.　(C)　2013　Elsevier　B.V.　All　rights　reserved.

Keyword：

Compressed domain Parameter modification CELP Speech enhancement G.722.2

Author Community：

[ 1 ] [Xia, Bingyin]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

鲍长春
[Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

baochch@bjut.edu.cn

Show more details

Related Keywords：

Visual Attention Model Based Regions of Interest Detection in Compressed Domain
2012，CHINESE JOURNAL OF ELECTRONICS
Compressed domain based pornographic image recognition using multi-cost sensitive decision trees
2013，SIGNAL PROCESSING
Social images tag ranking based on visual words in compressed domain
2015，NEUROCOMPUTING
An approach of bag-of-words based on visual attention model for pornographic images recognition in compressed domain
2013，NEUROCOMPUTING

Source ：

SPEECH COMMUNICATION

ISSN： 0167-6393

Year： 2013

Issue： 5

Volume： 55

Page： 619-640

3 . 2 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to