DocumentCode :
1749767
Title :
Perceptual and objective detection of discontinuities in concatenative speech synthesis
Author :
Stylianou, Yannis ; Syrdal, Ann K.
Author_Institution :
Shannon Labs., AT&T Labs-Research, Florham Park, NJ, USA
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
837
Abstract :
Concatenative speech synthesis systems attempt to minimize audible signal discontinuities between two successive concatenated units. An objective distance measure which is able to predict audible discontinuities is therefore very important, particularly in unit selection synthesis, for which units are selected from among a large inventory at run time. In this paper, we describe a perceptual test to measure the detection rate of concatenation discontinuity by humans, and then we evaluate 13 different objective distance measures based on their ability to predict the human results. Criteria used to classify these distances include the detection rate, the Bhattacharyya measure of separability of two distributions, and receiver operating characteristic (ROC) curves. Results show that the Kullback-Leibler distance on power spectra has the higher detection rate followed by the Euclidean distance on Mel-frequency cepstral coefficients (MFCC)
Keywords :
cepstral analysis; spectral analysis; speech synthesis; Bhattacharyya measure of separability; Euclidean distance; Kullback-Leibler distance; Mel-frequency cepstral coefficients; ROC curves; audible discontinuities; concatenation discontinuity detection rate; concatenative speech synthesis; objective distance measure; perceptual test; power spectra; receiver operating characteristic curves; signal discontinuities; successive concatenated units; unit selection synthesis; Cepstral analysis; Concatenated codes; Euclidean distance; Humans; Mel frequency cepstral coefficient; Particle measurements; Signal synthesis; Speech synthesis; Testing; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.941045
Filename :
941045
Link To Document :
بازگشت