DocumentCode
2177797
Title
Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise
Author
Valentini-Botinhao, Cassia ; Yamagishi, Junichi ; King, Simon
Author_Institution
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear
2011
fDate
22-27 May 2011
Firstpage
5112
Lastpage
5115
Abstract
In this paper we evaluate four objective measures of speech with regards to intelligibility prediction of synthesized speech in diverse noisy situations. We evaluated three intelligibility measures, the Dau measure, the glimpse proportion and the Speech Intelligibility Index (SII) and a quality measure, the Perceptual Evaluation of Speech Quality (PESQ). For the generation of synthesized speech we used a state of the art HMM-based speech synthesis system. The noisy conditions comprised four additive noises. The measures were compared with subjective intelligibility scores obtained in listening tests. The results show the Dau and the glimpse measures to be the best predictors of intelligibility, with correlations of around 0.83 to subjective scores. All measures gave less accurate predictions of intelligibility for synthetic speech than have previously been found for natural speech; in particular the SII measure. In additional experiments, we processed the synthesized speech by an ideal binary mask before adding noise. The Glimpse measure gave the most accurate intelligibility predictions in this situation.
Keywords
hidden Markov models; speech intelligibility; speech synthesis; HMM-based speech synthesis system; HMM-based synthetic speech; PESQ; SII ideal binary mask; additive noises; intelligibility prediction; Correlation; Hidden Markov models; Noise; Noise measurement; Root mean square; Speech; Speech synthesis; HMM-based speech synthesis; objective measures for speech intelligibility;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947507
Filename
5947507
Link To Document