DocumentCode :
1870808
Title :
A Comparison of Spectral Continuity Measures as a Join Cost in Concatenative Speech Synthesis
Author :
Kirkpatrick, Barbara ; O´Brien, D. ; Scaife, Ronan
Author_Institution :
Sch. of Comput., Dublin City Univ.
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
515
Lastpage :
520
Abstract :
The quality of concatenative speech synthesis depends on the cost function employed for unit selection. Cost functions for spectral continuity are difficult to define and standard measures used for this task often do not accurately reflect human perception of discontinuity across a concatenated join. We compare a set of standard distance measures for the task of detecting audible discontinuities and introduce a new measure. A perceptual experiment is described that was used to relate each measure to human perception of discontinuities. The impact of window length on feature extraction and subsequent detection of discontinuities is investigated. The distance measure approach to detecting audible discontinuities is extended to a feature space based representation and feature transformations are investigated as a means of improving discontinuity detection. Receiver operating characteristic (ROC) curves is used to compare the results, which indicate that the feature space approach improves on the performance of standard measures
Keywords :
sensitivity analysis; spectral analysis; speech synthesis; ROC curves; audible discontinuities detection; concatenative speech synthesis; cost function; human perception; receiver operating characteristic; spectral continuity measures; Speech synthesis; distance measure; join cost; perceptual listening test; statistical pattern recognition; unit selection;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Irish Signals and Systems Conference, 2006. IET
Conference_Location :
Dublin
Print_ISBN :
0-86341-665-9
Type :
conf
Filename :
4123956
Link To Document :
بازگشت