DocumentCode
3731909
Title
TTS evaluation: Double-ended objective quality measures
Author
Sunil Rao;C. Mahima;S. Vishnu;S. Adithya;A. Sricharan;V. Ramasubramanian
Author_Institution
PES Institute of Technology - Bangalore South Campus (PESIT-BSC), 560100, India
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
1
Lastpage
6
Abstract
We address the problem of TTS speech quality evaluation and propose a double-ended objective measure in the form of average spectral distortion between time-aligned reference and synthesized speech, where the reference signal is made available as the speech of the text input to the TTS spoken by the same speaker as the unit-database. We detail the time-aligned spectral distortion measure calculated via dynamic time-warping and apply this measure for comparison of the effectiveness of 5 different automatic segmentation techniques for annotating the unit-database for two Indian languages, Tamil and Kannada. We also show that the proposed measure yields more meaningful scores than the PESQ measure which is popular for codec evaluation, but likely to be less suited for TTS evaluation due to the lack of a rigorous time-alignment as is done in the proposed spectral distortion measure here.
Keywords
"Speech","Distortion measurement","Speech coding","Nonlinear distortion","Time measurement","Weight measurement"
Publisher
ieee
Conference_Titel
Electronics, Computing and Communication Technologies (CONECCT), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/CONECCT.2015.7383899
Filename
7383899
Link To Document