DocumentCode
1937422
Title
Effect of prosodic naturalness on segmental acceptability in synthetic speech
Author
Vainio, Martti ; Jarvikivi, J. ; Werner, Stefan ; Volk, Nicholas ; Välikangas, Jarmo
Author_Institution
Depts. of Phonetics & Gen. Linguistics, Univ. of Helsinki, Finland
fYear
2002
fDate
11-13 Sept. 2002
Firstpage
143
Lastpage
146
Abstract
It is commonly agreed that one of the major goals in the development of modem text-to-speech synthesis is the improvement of prosody, especially intonation. Although high quality intonation is an important factor on the way to more natural synthetic speech, it is seldom scrutinized empirically whether and how this affects the relative performance of other components, such as segmental synthesis. The present paper discusses two preliminary rating experiments inquiring into the relation between the naturalness of intonation and subjective segmental quality in Finnish. Experiment 1 showed that the perception of intonation is dependent on the segmental quality. More crucially, experiment 2 indicated that also the perceived segmental acceptability is significantly dependent on the relative naturalness of intonation. In light of the present observations, the goal of improved intonation is not only desirable for the overall quality´s sake alone, but it is also shown to improve even the subjective perception of a very basic feature of synthetic speech such as segmental acceptability.
Keywords
speech processing; speech synthesis; Finnish language; intonation naturalness; prosodic naturalness; segmental acceptability; segmental synthesis; subjective segmental quality; synthetic speech; text-to-speech synthesis; Modems; Multidimensional systems; Shape; Speech synthesis; Synthesizers; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
Print_ISBN
0-7803-7395-2
Type
conf
DOI
10.1109/WSS.2002.1224394
Filename
1224394
Link To Document