Title :
The pause duration prediction for Mandarin text-to-speech system
Author :
Yu, Jian ; Tao, Jianhua
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
In this paper, we enter into detailed analysis on how the pause duration under different prosodic boundaries are affected by various contextual factors in natural speech. To get the correlation between them, the paper calculates the mean pause duration under different prosodic boundaries. The contextual factors investigated in this paper contains both linguistic features, such as boundary types, syllable tones of boundary sides, initial and final types etc, and acoustic features, such as pitch gap across the boundary. The paper makes experiments and discussion which reveals the influence of these factors on pause duration. Based on that, the paper creates a pause duration prediction model for Mandarin speech synthesis system. The model was proved to be able to generate high quality prosody output with the listening test.
Keywords :
linguistics; natural languages; speech processing; speech synthesis; Mandarin text-to-speech system; acoustic feature; linguistic feature; pause duration prediction model; prosodic boundary; speech quality; Automation; Laboratories; Natural languages; Pattern analysis; Pattern recognition; Predictive models; Speech analysis; Speech synthesis; Tagging; Testing;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598735