• DocumentCode
    2311151
  • Title

    Perceptual Distortion Analysis And Quality Estimation Of Prosody-Modified Speech For Td-Psola

  • Author

    Chen, Shi-Han ; Chen, Shun-Ju ; Kuo, Chih-Chung

  • Author_Institution
    Adv. Technol. Center, ITRI, Hsinchu
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    TD-PSOLA is one of the most widely used prosodic modification techniques. However, perceptible distortions are introduced occasionally and how TD-PSOLA affects speech quality has not been fully understood and controlled. In this paper, we present a quality estimation method before performing modification. By exploiting relationship between prosodic modifications and subjective scores, 27 distance measures are proposed and respective performances are given and compared. Extensive search is used to find every possible combination among these measures, and the best correlation between the predicted and subjective scores is 87.6%, which can be obtained by linear regression of 4 proposed distance measures. The proposed method does not require synthesizing target and can be used both in online unit selection and off-line corpus design of TTS systems
  • Keywords
    regression analysis; speech synthesis; TD-PSOLA; linear regression; perceptual distortion analysis; prosodic modification techniques; prosody-modified speech; quality estimation; Area measurement; Distortion measurement; Linear regression; Materials testing; Performance evaluation; Predictive models; Speech analysis; Speech synthesis; Synthesizers; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660157
  • Filename
    1660157