• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Ji, Qiang (Ji, Qiang.) | Bao, Changchun (Bao, Changchun.) (学者:鲍长春)

收录:

CPCI-S EI Scopus

摘要:

Multi-channel speech coding is an indispensable technology in the field of multi-input multi-output (MIMO) speech interaction. With the development of microphone array signal processing technology, the multi-channel speech coding has been paid more attention. In this paper, a multi-channel speech coding method is proposed based on linear microphone array, which combines the advantages of source speech codec and spatial information of microphone array. At the encoder, the open source Speex encoder is employed to encode speech signal from reference channel. The Inter-Channel Level Difference (ICLD) and Inter-Channel Time Difference (ICTD) are used as the spatial information of speech source and coded together. Considering the auditory characteristics of the human, the ICLD and ICTD are extracted in each sub-band divided by the Gammatone filter. At the decoder, the decoded speech signal of reference channel and the decoded ICLD and ICTD are used to reconstruct speech signals of all channels. The reconstructed speech based on this approach show a higher perceptual quality than the classical methods according to objective evaluation scores. Moreover, the experimental results confirmed that the proposed method can reduce bit rates while preserving speech quality and spatial information as much as possible.

关键词:

Speex Codec Gammatone filter microphone array spatial perception cues speech coding

作者机构:

  • [ 1 ] [Ji, Qiang]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing, Peoples R China
  • [ 2 ] [Bao, Changchun]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing, Peoples R China

通讯作者信息:

  • 鲍长春

    [Bao, Changchun]Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing, Peoples R China

查看成果更多字段

相关关键词:

来源 :

PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020)

ISSN: 2164-5221

年份: 2020

页码: 136-140

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:129/3903461
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司