收录:
摘要:
In recent years, deep neural network (DNN) has been widely used for monaural speech enhancement due to its good performance for learning higher-level information. In this paper, an approach of speech enhancement with binaural cues derived from DNN is proposed. A deep-learning-based model is investigated to learn a mapping function between the pre-enhanced cue and clean cue, which are extracted from the pre-enhanced speech and clean speech, respectively. The proposed method contains two stages: offline training stage and online enhancing stage. At offline training stage, a stacked auto-encoder (SAE) model, a type of deep neural network, is used to learn the mapping function. At online stage, the clean cue is estimated by the learned mapping function online first. Then, the noisy speech can be enhanced with the estimated clean cue. Compared to the reference methods, the experimental results yield significant improvements for three objective measurements.
关键词:
通讯作者信息:
来源 :
2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017)
ISSN: 2309-9402
年份: 2017
页码: 145-148
语种: 英文
归属院系: