• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Cui, Zihao (Cui, Zihao.) | Bao, Changchun (Bao, Changchun.) (学者:鲍长春) | Nielsen, Jesper Kjar (Nielsen, Jesper Kjar.) | Grasboll Christensen, Mads (Grasboll Christensen, Mads.)

收录:

EI

摘要:

In this paper, a method for estimating the autoregressive parameters from a signal segment is proposed. The method is based on a deep neural network (DNN) in combination with the classical Levinson-Durbin recursion (LDR). The DNN acts as a pre-processor for the LDR and can be trained on different metrics commonly encountered in speech processing using a generalized analysis-by-synthesis (GABS) structure where the LDR acts as the encoder. Unlike end-to-end data-driven approaches, this structure ensures that the DNN is easy to train and initialize since the DNN only has to learn a simple mapping. The results confirm this and show that the proposed method produces an AR-spectrum that efficiently represents the speech spectrum in terms of the Itakura-Saito divergence, Kullback-Leibler divergence, log-spectral distortion, and speech distortion. © 2020 IEEE.

关键词:

作者机构:

  • [ 1 ] [Cui, Zihao]Beijing University of Technology, Faculty of Information Technology, Beijing, China
  • [ 2 ] [Bao, Changchun]Beijing University of Technology, Faculty of Information Technology, Beijing, China
  • [ 3 ] [Nielsen, Jesper Kjar]Aalborg University, Audio Analysis Lab, CREATE, Denmark
  • [ 4 ] [Grasboll Christensen, Mads]Aalborg University, Audio Analysis Lab, CREATE, Denmark

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

ISSN: 1520-6149

年份: 2020

卷: 2020-May

页码: 6759-6763

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 3

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

归属院系:

在线人数/总访问数:3019/2977034
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司