DocumentCode :
542232
Title :
Improved duration control for speech synthesis using a multigram language model
Author :
Eichner, Matthias ; Wolff, Matthias ; Hoffmann, Rudiger
Author_Institution :
Dresden University of Technology, Laboratory of Acoustics and Speech Communication, D-01062, Germany
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
Speech synthesis systems based on concatenation of segments derived from natural speech are very intelligible and achieve a high overall quality. Even though listeners often complain about wrong or missing temporal structure and timing in synthetic speech. We propose a new approach for duration control in speech synthesis that uses the probability of a word in its context to control the local speaking rate within the utterance. This idea bases on the observation that words that are very likely to occur in a given context are pronounced less accurate and faster than improbable ones. In this paper we introduce an algorithm that implements the duration control using a multigram language model and will present first experimental results.
Keywords :
Lead; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743743
Filename :
5743743
Link To Document :
بازگشت