Title :
Perceptual and objective detection of discontinuities in concatenative speech synthesis
Author :
Stylianou, Yannis ; Syrdal, Ann K.
Author_Institution :
Shannon Labs., AT&T Labs-Research, Florham Park, NJ, USA
Abstract :
Concatenative speech synthesis systems attempt to minimize audible signal discontinuities between two successive concatenated units. An objective distance measure which is able to predict audible discontinuities is therefore very important, particularly in unit selection synthesis, for which units are selected from among a large inventory at run time. In this paper, we describe a perceptual test to measure the detection rate of concatenation discontinuity by humans, and then we evaluate 13 different objective distance measures based on their ability to predict the human results. Criteria used to classify these distances include the detection rate, the Bhattacharyya measure of separability of two distributions, and receiver operating characteristic (ROC) curves. Results show that the Kullback-Leibler distance on power spectra has the higher detection rate followed by the Euclidean distance on Mel-frequency cepstral coefficients (MFCC)
Keywords :
cepstral analysis; spectral analysis; speech synthesis; Bhattacharyya measure of separability; Euclidean distance; Kullback-Leibler distance; Mel-frequency cepstral coefficients; ROC curves; audible discontinuities; concatenation discontinuity detection rate; concatenative speech synthesis; objective distance measure; perceptual test; power spectra; receiver operating characteristic curves; signal discontinuities; successive concatenated units; unit selection synthesis; Cepstral analysis; Concatenated codes; Euclidean distance; Humans; Mel frequency cepstral coefficient; Particle measurements; Signal synthesis; Speech synthesis; Testing; Time measurement;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941045