Prediction of speech pauses based on punctuation information and statistical language model - Details

Author：

Qian, Yi-Li (Qian, Yi-Li.) | Xun, En-Dong (Xun, En-Dong.)

Indexed by：

EI Scopus PKU CSCD

Abstract：

Speech　pauses　are　considered　as　punctuation　marks　of　spoken　language.　People　always　insert　different　pauses　at　the　boundaries　of　rhythmic　phrases　when　communicating　by　language.　Based　on　this　characteristic,　the　speech　pause　of　punctuation　marks　is　investigated　and　the　concept　of　predicting　speech　pauses　using　punctuation　information　is　proposed.　The　punctuation-based　and　SLM-based　methods　are　introduced　to　obtain　training　corpus　and　predict　speech　pauses.　The　influence　of　training　corpus　size　on　the　performance　of　model　is　discussed.　And　the　performance　of　punctuation-based　corpus　and　manually-labeled　corpus　is　compared.　Experimental　results　show　that　the　Chinese　punctuation　supplies　valuable　information　on　pause,　and　the　method　based　on　punctuation　information　can　predict　the　Chinese　speech　pauses　effectively.

Keyword：

Natural language processing systems Forecasting Speech Computational linguistics

Author Community：

[ 1 ] [Qian, Yi-Li]College of Computer Science, Beijing University of Technology, Beijing 100022, China
[ 2 ] [Qian, Yi-Li]College of Computer and Information Technology, Shanxi University, Taiyuan 030006, China
[ 3 ] [Xun, En-Dong]College of Information Sciences, Beijing Language and Culture University, Beijing 100083, China

Reprint Author's Address：

Email：

qyl@sxu.edu.cn

Show more details

Related Keywords：

Use of multi-strategy to Textual Entailment recognition
2011，Journal of Computational Information Systems
An analysis of key issues in Chinese word segmentation
2013，Journal of Computational Information Systems
Statistical language model adaptation based on N-gram distribution
2008，Journal of Beijing University of Aeronautics and Astronautics
Multi-domain global correlation degree branching entropy method for microblog text word segmentation
2020，2020 ACM Turing Celebration Conference - China, ACM TURC 2020

Source ：

Pattern Recognition and Artificial Intelligence

ISSN： 1003-6059

Year： 2008

Issue： 4

Volume： 21

Page： 541-545

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部计算机学院

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to