Title :
Prediction of Prosodic Phrase Boundaries in Chinese TTS Based on Conditional Random Fields and Transformation Based Learning
Author :
Zhao, Ziping ; Zhu, Yaoting
Author_Institution :
Coll. of Comput. & Inf. Eng., Tianjin Normal Univ., Tianjin, China
Abstract :
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper we proposed an approach for prediction of Chinese prosodic phrase boundaries in unrestricted Chinese text, which combines Conditional Random Fields (CRFs) model and TBL model. First a CRFs model is trained to predict the prosodic phrase boundaries. After that we apply a TBL based error driven learning approach to amend the initial prediction. A comparison is conducted between the new model and HMM for prosodic phrase break prediction. Experiments show that the combined approach improves overall performance. The precision and recall ratio are improved.
Keywords :
hidden Markov models; learning (artificial intelligence); speech synthesis; Mandarin Chinese speech; conditional random fields; error driven learning approach; hidden Markov model; prosodic phrase grouping; speech synthesis system; text-to-speech system; transformation based learning; Educational institutions; Entropy; Fuzzy systems; Graphical models; Hidden Markov models; Knowledge engineering; Predictive models; Probability; Speech synthesis; Statistical analysis; Conditional Random Fields(CRFs); Prosodic Phrase; Text-to-speech system(TTS); Transformation-based error-driven learning(TBL);
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.730