Title :
Prediction of Prosodic Word Boundaries in Chinese TTS Based on Maximum Entropy Markov Model and Transformation Based Learning
Author :
Ziping Zhao ; Xirong Ma
Author_Institution :
Coll. of Comput. & Inf. Eng., Tianjin Normal Univ., Tianjin, China
Abstract :
Hierarchical prosody structure generation is a key component for a speech synthesis system. As the basic prosodic unit, the prosodic word plays an important role for the naturalness and the intelligibility for the Chinese TTS system. In this paper we proposed an approach for prediction of Chinese prosodic word boundaries in unrestricted Chinese text, which combines Maximum Entropy Markov Model(MEMM) and TBL model. First MEMM is trained to predict the prosodic word boundaries. After that we apply a TBL based error driven learning approach to amend the initial prediction. A comparison is conducted between the new model and HMM for prosodic word boundaries prediction. Experiments show that the combined approach improves overall performance. The precision and recall ratio are improved.
Keywords :
hidden Markov models; learning (artificial intelligence); maximum entropy methods; speech synthesis; Chinese TTS system; HMM; MEMM; TBL based error driven learning approach; maximum entropy Markov model; prosodic word boundaries prediction; text-to-speech system; transformation based learning; Computational modeling; Educational institutions; Entropy; Hidden Markov models; Markov processes; Predictive models; Speech; Maximum Entropy Markov Model (MEMM); Prosodic Word; Text-to-speech system (TTS); Transformation-based error-driven learning(TBL);
Conference_Titel :
Computational Intelligence and Security (CIS), 2012 Eighth International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4673-4725-9
DOI :
10.1109/CIS.2012.64