• DocumentCode
    2801269
  • Title

    Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis

  • Author

    Lei, Ming ; Ling, Zhen-Hua ; Dai, Li-Rong

  • Author_Institution
    iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4230
  • Lastpage
    4233
  • Abstract
    This paper presents a minimum generation error (MGE) training method using weighted Euclidean distance measure on line spectral pairs (LSP) for HMM-based speech synthesis. In this paper, weighted Euclidean distance on LSP is introduced as the measurement of generation error to improve the consistency between the model training criterion and the subjective perception on the distortion of synthetic speech. Several common weighting techniques are investigated and compared within the MGE training framework. The experimental results show that the formant bounded weighting (FBW) method achieves the best performance, which improves the naturalness of synthetic speech significantly compared with the Euclidean LSP distance measure. Compared with the MGE training using log spectral distortion (LSD) measure, the FBW criterion can achieve similar performance on naturalness with much less computation complexity of model training.
  • Keywords
    distortion; hidden Markov models; speech synthesis; HMM based speech synthesis; formant bounded weighting method; line spectral pairs; log spectral distortion measure; minimum generation error training; synthetic speech distortion; weighted Euclidean distance; Acceleration; Computational modeling; Distortion measurement; Euclidean distance; Hidden Markov models; Maximum likelihood estimation; Predictive models; Speech synthesis; Synthesizers; Weight measurement; Speech synthesis; hidden Markov model; line spectral pairs; minimum generation error;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495688
  • Filename
    5495688