• DocumentCode
    394294
  • Title

    A Mandarin intonation prediction model that can output real pitch patterns

  • Author

    Pan, Neng-Huang ; Yu, Ming-shing ; Wu, Ming-Jer

  • Author_Institution
    Dept. of Appl. Math., Nat. Chung-Hsing Univ., Taichung, Taiwan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    In this paper we proposed an intonation prediction model for Mandarin TTS systems. Our model can output real pitch patterns by finding a suitable real pitch pattern from the training corpus. This method is a new experiment. The advantages of our model are as follows. (1) It can improve the naturalness of the synthesized speech. It gets higher scores in the subjective listening tests. (2) It has high accuracies. Average errors of 0.425 ms and 0.457 ms were obtained for the inside and outside tests, respectively. Pattern errors of 0.128 ms and 0.129 ms were obtained for the inside and outside tests, respectively. We found that the pattern error measurement method complies with human hearing. (3) The training corpus need not be very large. It can relieve the data sparsity problem.
  • Keywords
    pattern recognition; speech processing; speech synthesis; Mandarin intonation prediction model; TTS systems; accuracies; data sparsity problem; human hearing; pattern error measurement; real pitch patterns; subjective listening tests; synthesized speech naturalness; training corpus; Auditory system; Electronic mail; Humans; Mathematical model; Mathematics; Predictive models; Signal synthesis; Speech synthesis; Testing; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198826
  • Filename
    1198826