Indexed by:
Abstract:
The recent advances of large-scale speech corpus (LSSC) and text-to-speech (TTS) technologies are briefly reviewed, then the architecture and annotation information of a large-scale speech corpus Slib are introduced. Based on Slib, the LSSC-oriented indexing methods is discussed, the set operations and the minimum cover problem related to information retrieval in LSSC are presented. The minimum cover problem is a NP-complete problem, and a greedy algorithm is proposed to obtain an approximation solution. The approximation ratio of the proposed algorithm is analyzed. The application and realization of set operations in TTS are presented, and an approach for choosing proper speech instances of linguistic units based on minimum cover is developed, which can improve the naturalness of the synthesized speech of TTS system.
Keyword:
Reprint Author's Address:
Email:
Source :
Chinese Journal of Computers
ISSN: 0254-4164
Year: 2010
Issue: 4
Volume: 33
Page: 687-696
Cited Count:
SCOPUS Cited Count: 5
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 0
Affiliated Colleges: