DocumentCode
394294
Title
A Mandarin intonation prediction model that can output real pitch patterns
Author
Pan, Neng-Huang ; Yu, Ming-shing ; Wu, Ming-Jer
Author_Institution
Dept. of Appl. Math., Nat. Chung-Hsing Univ., Taichung, Taiwan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
In this paper we proposed an intonation prediction model for Mandarin TTS systems. Our model can output real pitch patterns by finding a suitable real pitch pattern from the training corpus. This method is a new experiment. The advantages of our model are as follows. (1) It can improve the naturalness of the synthesized speech. It gets higher scores in the subjective listening tests. (2) It has high accuracies. Average errors of 0.425 ms and 0.457 ms were obtained for the inside and outside tests, respectively. Pattern errors of 0.128 ms and 0.129 ms were obtained for the inside and outside tests, respectively. We found that the pattern error measurement method complies with human hearing. (3) The training corpus need not be very large. It can relieve the data sparsity problem.
Keywords
pattern recognition; speech processing; speech synthesis; Mandarin intonation prediction model; TTS systems; accuracies; data sparsity problem; human hearing; pattern error measurement; real pitch patterns; subjective listening tests; synthesized speech naturalness; training corpus; Auditory system; Electronic mail; Humans; Mathematical model; Mathematics; Predictive models; Signal synthesis; Speech synthesis; Testing; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198826
Filename
1198826
Link To Document