Problems on large-scale speech corpus and the applications in TTS - Details

Author：

Zhang, Sen (Zhang, Sen.) | Liu, Lei (Liu, Lei.) | Diao, Lu-Hong (Diao, Lu-Hong.)

Indexed by：

EI PKU CSCD

Abstract：

The　recent　advances　of　large-scale　speech　corpus　(LSSC)　and　text-to-speech　(TTS)　technologies　are　briefly　reviewed,　then　the　architecture　and　annotation　information　of　a　large-scale　speech　corpus　Slib　are　introduced.　Based　on　Slib,　the　LSSC-oriented　indexing　methods　is　discussed,　the　set　operations　and　the　minimum　cover　problem　related　to　information　retrieval　in　LSSC　are　presented.　The　minimum　cover　problem　is　a　NP-complete　problem,　and　a　greedy　algorithm　is　proposed　to　obtain　an　approximation　solution.　The　approximation　ratio　of　the　proposed　algorithm　is　analyzed.　The　application　and　realization　of　set　operations　in　TTS　are　presented,　and　an　approach　for　choosing　proper　speech　instances　of　linguistic　units　based　on　minimum　cover　is　developed,　which　can　improve　the　naturalness　of　the　synthesized　speech　of　TTS　system.

Keyword：

Speech Computational complexity Linguistics Information retrieval Approximation algorithms

Author Community：

[ 1 ] [Zhang, Sen]Information and Computation Mathematics Lab, Beijing University of Technology, Beijing 100022, China
[ 2 ] [Liu, Lei]Information and Computation Mathematics Lab, Beijing University of Technology, Beijing 100022, China
[ 3 ] [Diao, Lu-Hong]Information and Computation Mathematics Lab, Beijing University of Technology, Beijing 100022, China