Title :
A Maximum Entropy Markov Model for Prediction of Prosodic Phrase Boundaries in Chinese TTS
Author :
Zhao, Ziping ; Zhao, Tingjian ; Zhu, Yaoting
Author_Institution :
Nankai Univ., Tianjin
Abstract :
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.
Keywords :
Markov processes; maximum entropy methods; natural language processing; probability; speech synthesis; Mandarin Chinese speech flow; conditional probability; hierarchical prosody structure generation; maximum entropy Markov model; prosodic phrase boundary prediction; text-to-speech synthesis system; transition probability; Educational institutions; Entropy; Geographic Information Systems; Hidden Markov models; Predictive models; Probability; Speech synthesis; State estimation; Statistical analysis; Viterbi algorithm;
Conference_Titel :
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3032-1
DOI :
10.1109/GrC.2007.66