• DocumentCode
    3731909
  • Title

    TTS evaluation: Double-ended objective quality measures

  • Author

    Sunil Rao;C. Mahima;S. Vishnu;S. Adithya;A. Sricharan;V. Ramasubramanian

  • Author_Institution
    PES Institute of Technology - Bangalore South Campus (PESIT-BSC), 560100, India
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We address the problem of TTS speech quality evaluation and propose a double-ended objective measure in the form of average spectral distortion between time-aligned reference and synthesized speech, where the reference signal is made available as the speech of the text input to the TTS spoken by the same speaker as the unit-database. We detail the time-aligned spectral distortion measure calculated via dynamic time-warping and apply this measure for comparison of the effectiveness of 5 different automatic segmentation techniques for annotating the unit-database for two Indian languages, Tamil and Kannada. We also show that the proposed measure yields more meaningful scores than the PESQ measure which is popular for codec evaluation, but likely to be less suited for TTS evaluation due to the lack of a rigorous time-alignment as is done in the proposed spectral distortion measure here.
  • Keywords
    "Speech","Distortion measurement","Speech coding","Nonlinear distortion","Time measurement","Weight measurement"
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Computing and Communication Technologies (CONECCT), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/CONECCT.2015.7383899
  • Filename
    7383899