Title :
Comparison of Syllable/Phone HMM Based Mandarin TTS
Author :
Duan, Quansheng ; Kang, Shiyin ; Wu, Zhiyong ; Cai, Lianhong ; Shuang, Zhiwei ; Qin, Yong
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
The performance of HMM-based text to speech (TTS) system is affected by the basic modeling units and the size of training data. This paper compares two HMM based Mandarin TTS systems using syllable and phone as basic units respectively with 1000, 3000 and 5000 sentences´ training data. Two female speakers´ corpora are used as training data for evaluation. For both corpora, the system using syllable as basic unit outperforms the system using phone as basic unit with 3000 and 5000 sentences´ training data.
Keywords :
hidden Markov models; natural language processing; speech synthesis; HMM-based Mandarin text to speech system; hidden Markov model; speech synthesis; syllable based tonal language; syllable-phone HMM; Context; Hidden Markov models; IP networks; Speech; Speech synthesis; Training; Training data; HMM; Mandarin; Speech Synthesis; syllable;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1092