收录:
摘要:
The bandwidth limitation of wideband audio systems degrades the subjective quality and naturalness of audio signals. In this paper, a new method for blind bandwidth extension of wideband audio signals is proposed based on ensemble echo state network with temporal evolution. The high-frequency components in the band of 7 ∼ 14 kHz are artificially restored only from the information in the wideband audio. For each region in the wideband feature space, a specific echo state network with recurrent structure is explored to dynamically model the local mapping relationship between wideband audio features and highfrequency spectral envelope. The transition process among regions is modeled by a hidden Markov model, and a network ensemble technique based on temporal evolution is used to fuse multiple echo state networks such that the high-frequency spectral envelope is estimated. Combining the high-frequency fine spectrum extended by spectral translation, the proposed method can effectively extend the wideband audio to super wideband. In addition, the proposed extension method is applied to the ITU-T G.729.1 wideband audio codec and is further evaluated in comparison with the ITU-T G.729.1 Annex E super-wideband audio codec and the hidden Markov model-based reference bandwidth extension method. Objective quality evaluation results indicate that the proposed method is preferred over the hidden Markov model-based reference bandwidth extension method in terms of log spectral distortion, cosh measure, and differential log spectral distortion. Further, the proposed method improves the auditory quality of the wideband audio and also gains a good performance in the subjective listening tests. ©2016 IEEE.
关键词:
通讯作者信息:
电子邮件地址:
来源 :
ACM Transactions on Audio Speech and Language Processing
ISSN: 2329-9290
年份: 2016
期: 3
卷: 24
页码: 594-607
5 . 4 0 0
JCR@2022
ESI学科: ENGINEERING;
ESI高被引阀值:166
中科院分区:2