收录:
摘要:
The recent advances of large-scale speech corpus (LSSC) and text-to-speech (TTS) technologies are briefly reviewed, then the architecture and annotation information of a large-scale speech corpus Slib are introduced. Based on Slib, the LSSC-oriented indexing methods is discussed, the set operations and the minimum cover problem related to information retrieval in LSSC are presented. The minimum cover problem is a NP-complete problem, and a greedy algorithm is proposed to obtain an approximation solution. The approximation ratio of the proposed algorithm is analyzed. The application and realization of set operations in TTS are presented, and an approach for choosing proper speech instances of linguistic units based on minimum cover is developed, which can improve the naturalness of the synthesized speech of TTS system.
关键词:
通讯作者信息:
电子邮件地址:
来源 :
Chinese Journal of Computers
ISSN: 0254-4164
年份: 2010
期: 4
卷: 33
页码: 687-696
归属院系: