DocumentCode :
2387368
Title :
A Maximum Entropy Markov Model for Prediction of Prosodic Phrase Boundaries in Chinese TTS
Author :
Zhao, Ziping ; Zhao, Tingjian ; Zhu, Yaoting
Author_Institution :
Nankai Univ., Tianjin
fYear :
2007
fDate :
2-4 Nov. 2007
Firstpage :
498
Lastpage :
498
Abstract :
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.
Keywords :
Markov processes; maximum entropy methods; natural language processing; probability; speech synthesis; Mandarin Chinese speech flow; conditional probability; hierarchical prosody structure generation; maximum entropy Markov model; prosodic phrase boundary prediction; text-to-speech synthesis system; transition probability; Educational institutions; Entropy; Geographic Information Systems; Hidden Markov models; Predictive models; Probability; Speech synthesis; State estimation; Statistical analysis; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3032-1
Type :
conf
DOI :
10.1109/GrC.2007.66
Filename :
4403149
Link To Document :
بازگشت