DocumentCode
2877480
Title
Prediction of Prosodic Word Boundaries in Chinese TTS Based on Maximum Entropy Markov Model and Transformation Based Learning
Author
Ziping Zhao ; Xirong Ma
Author_Institution
Coll. of Comput. & Inf. Eng., Tianjin Normal Univ., Tianjin, China
fYear
2012
fDate
17-18 Nov. 2012
Firstpage
258
Lastpage
261
Abstract
Hierarchical prosody structure generation is a key component for a speech synthesis system. As the basic prosodic unit, the prosodic word plays an important role for the naturalness and the intelligibility for the Chinese TTS system. In this paper we proposed an approach for prediction of Chinese prosodic word boundaries in unrestricted Chinese text, which combines Maximum Entropy Markov Model(MEMM) and TBL model. First MEMM is trained to predict the prosodic word boundaries. After that we apply a TBL based error driven learning approach to amend the initial prediction. A comparison is conducted between the new model and HMM for prosodic word boundaries prediction. Experiments show that the combined approach improves overall performance. The precision and recall ratio are improved.
Keywords
hidden Markov models; learning (artificial intelligence); maximum entropy methods; speech synthesis; Chinese TTS system; HMM; MEMM; TBL based error driven learning approach; maximum entropy Markov model; prosodic word boundaries prediction; text-to-speech system; transformation based learning; Computational modeling; Educational institutions; Entropy; Hidden Markov models; Markov processes; Predictive models; Speech; Maximum Entropy Markov Model (MEMM); Prosodic Word; Text-to-speech system (TTS); Transformation-based error-driven learning(TBL);
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Security (CIS), 2012 Eighth International Conference on
Conference_Location
Guangzhou
Print_ISBN
978-1-4673-4725-9
Type
conf
DOI
10.1109/CIS.2012.64
Filename
6405909
Link To Document