• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Liu, X. (Liu, X..) (学者:刘晓) | Bao, C. (Bao, C..) | Sr. (Sr..)

收录:

Scopus

摘要:

The bandwidth limitation of wideband audio systems degrades the subjective quality and naturalness of audio signals. In this paper, a new method for blind bandwidth extension of wideband audio signals is proposed based on ensemble echo state network with temporal evolution. The high-frequency components in the band of 7 ∼ 14 kHz are artificially restored only from the information in the wideband audio. For each region in the wideband feature space, a specific echo state network with recurrent structure is explored to dynamically model the local mapping relationship between wideband audio features and highfrequency spectral envelope. The transition process among regions is modeled by a hidden Markov model, and a network ensemble technique based on temporal evolution is used to fuse multiple echo state networks such that the high-frequency spectral envelope is estimated. Combining the high-frequency fine spectrum extended by spectral translation, the proposed method can effectively extend the wideband audio to super wideband. In addition, the proposed extension method is applied to the ITU-T G.729.1 wideband audio codec and is further evaluated in comparison with the ITU-T G.729.1 Annex E super-wideband audio codec and the hidden Markov model-based reference bandwidth extension method. Objective quality evaluation results indicate that the proposed method is preferred over the hidden Markov model-based reference bandwidth extension method in terms of log spectral distortion, cosh measure, and differential log spectral distortion. Further, the proposed method improves the auditory quality of the wideband audio and also gains a good performance in the subjective listening tests. ©2016 IEEE.

关键词:

Audio bandwidth extension; Audio coding; Echo state network; Hidden markov model

作者机构:

  • [ 1 ] [Liu, X.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Bao, C.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

ACM Transactions on Audio Speech and Language Processing

ISSN: 2329-9290

年份: 2016

期: 3

卷: 24

页码: 594-607

5 . 4 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:102

中科院分区:2

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 7

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:1502/2912173
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司