Title :
Study on Prediction of Prosodic Phrase Boundaries in Chinese TTS
Author :
Zhao, Ziping ; Zhu, Yaoting
Author_Institution :
Nankai Univ., Tianjin
fDate :
July 30 2007-Aug. 1 2007
Abstract :
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper three methods are proposed to predict prosodic phrase. The first is a statistic probability model, which efficiently combines the local POS and word length information. Experiments show by choosing appropriate threshold the model can reach a high precision and high recall ratio. Secondly we use the decision tree learning algorithm combined with the pause rules of Chinese empty words to predict prosodic phrase boundary in unrestricted Chinese text. The experiments show that the approach improves overall performance. Another is an SVM-based method. The precision and recall ratio are improved after using SVM classifier.
Keywords :
speech synthesis; statistical analysis; support vector machines; Mandarin Chinese speech flow; decision tree learning algorithm; hierarchical prosody structure generation; prosodic phrase boundaries; prosodic phrase grouping; speech synthesis system; statistic probability model; Artificial intelligence; Distributed computing; Educational institutions; Hidden Markov models; Intelligent networks; Probability; Software engineering; Speech synthesis; Support vector machine classification; Support vector machines;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
DOI :
10.1109/SNPD.2007.67