• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Ma, Yong (Ma, Yong.) | Bao, Chang-Chun (Bao, Chang-Chun.) (学者:鲍长春)

收录:

EI Scopus SCIE

摘要:

Sparse deep neural networks (SDNNs) for speaker segmentation are proposed. First, the SDNNs are trained using the side information that is the class label of the input. Then, speaker-specific features are extracted from the super-vector feature of the speech signal by the SDNNs. Lastly, the label of each speech frame is obtained by K-means clustering, which is used to segment different speakers of a continuous speech stream. The performance evaluation using the multi-speaker speech stream corpus generated from the TIMIT database shows that the proposed speaker segmentation algorithm outperforms the Bayesian information criterion method and the deep auto-encoder networks method.

关键词:

作者机构:

  • [ 1 ] [Ma, Yong]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Chang-Chun]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China
  • [ 3 ] [Ma, Yong]Jiangsu Normal Univ, Sch Phys & Elect Engn, Xuzhou, Peoples R China

通讯作者信息:

  • 鲍长春

    [Bao, Chang-Chun]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Sch Elect Informat & Control Engn, Beijing 100124, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

ELECTRONICS LETTERS

ISSN: 0013-5194

年份: 2015

期: 8

卷: 51

1 . 1 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:174

JCR分区:3

中科院分区:4

被引次数:

WoS核心集被引频次: 1

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:221/3911574
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司