DocumentCode :
417103
Title :
Voice characteristics conversion for TTS using reverse VTLN
Author :
Eichner, Matthias ; Wolff, Matthias ; Hoffmann, Rüdiger
Author_Institution :
Dresden Univ. of Technol., Germany
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In the past, several approaches have been proposed for voice conversion in TTS systems. Mostly, conversion is done by modification of the spectral properties and pitch to match a certain target voice. This conversion causes distortions that deteriorate the quality of the synthesized speech. In this paper we investigate a very simple and straightforward method for voice conversion. It generates a new voice from the source speaker instead of generating a certain target speaker´s voice. For application in TTS systems it is often sufficient to synthesize new voices that sound sufficiently different to be distinguishable from each other. This is done by applying a spectral warping technique that is commonly used for speaker normalization in speech recognition systems called vocal tract length normalization (VTLN). Due to the low requirements of resources this method is especially suited for embedded systems.
Keywords :
embedded systems; spectral analysis; speech recognition; speech synthesis; TTS; embedded systems; reverse VTLN; source speaker; speaker normalization; spectral warping technique; speech recognition systems; vocal tract length normalization; voice characteristics conversion; Acoustic distortion; Character recognition; Databases; Embedded system; Loudspeakers; Signal processing; Signal synthesis; Speech recognition; Speech synthesis; Synthesizers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325911
Filename :
1325911
Link To Document :
بازگشت