Audio bandwidth extension based on ensemble echo state networks with temporal evolution - Details

Author：

Liu, X. (Liu, X..) (Scholars：刘晓) | Bao, C. (Bao, C..) | Sr. (Sr..)

Indexed by：

Scopus

Abstract：

The　bandwidth　limitation　of　wideband　audio　systems　degrades　the　subjective　quality　and　naturalness　of　audio　signals.　In　this　paper,　a　new　method　for　blind　bandwidth　extension　of　wideband　audio　signals　is　proposed　based　on　ensemble　echo　state　network　with　temporal　evolution.　The　high-frequency　components　in　the　band　of　7　∼　14　kHz　are　artificially　restored　only　from　the　information　in　the　wideband　audio.　For　each　region　in　the　wideband　feature　space,　a　specific　echo　state　network　with　recurrent　structure　is　explored　to　dynamically　model　the　local　mapping　relationship　between　wideband　audio　features　and　highfrequency　spectral　envelope.　The　transition　process　among　regions　is　modeled　by　a　hidden　Markov　model,　and　a　network　ensemble　technique　based　on　temporal　evolution　is　used　to　fuse　multiple　echo　state　networks　such　that　the　high-frequency　spectral　envelope　is　estimated.　Combining　the　high-frequency　fine　spectrum　extended　by　spectral　translation,　the　proposed　method　can　effectively　extend　the　wideband　audio　to　super　wideband.　In　addition,　the　proposed　extension　method　is　applied　to　the　ITU-T　G.729.1　wideband　audio　codec　and　is　further　evaluated　in　comparison　with　the　ITU-T　G.729.1　Annex　E　super-wideband　audio　codec　and　the　hidden　Markov　model-based　reference　bandwidth　extension　method.　Objective　quality　evaluation　results　indicate　that　the　proposed　method　is　preferred　over　the　hidden　Markov　model-based　reference　bandwidth　extension　method　in　terms　of　log　spectral　distortion,　cosh　measure,　and　differential　log　spectral　distortion.　Further,　the　proposed　method　improves　the　auditory　quality　of　the　wideband　audio　and　also　gains　a　good　performance　in　the　subjective　listening　tests.　©2016　IEEE.

Keyword：

Audio bandwidth extension; Audio coding; Echo state network; Hidden markov model

Author Community：

[ 1 ] [Liu, X.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Bao, C.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Source ：

ACM Transactions on Audio Speech and Language Processing

ISSN： 2329-9290

Year： 2016

Issue： 3

Volume： 24

Page： 594-607

5 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：166

CAS Journal Grade：2

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 7

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

信息学部

材料与制造学部本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to