Title :
Shape-invariant pitch-synchronous text-to-speech conversion
Author :
Banga, Eduardo R. ; García-Mateo, Carmen
Author_Institution :
ETSI Telecomunicacion, Vigo Univ., Spain
Abstract :
Text-to-speech (T-T-S) systems based on the concatenation of speech units need a prosodic modification algorithm to adjust the prosodic features of the stored speech units to the desired output values. We discuss the application of a sinusoidal shape-invariant model to a T-T-S system for Spanish, paying special attention to the concatenation issues and phase treatment. The resulting speech waveform resembles the waveform of its contributory units, without sounding reverberant as in other sinusoidal implementations
Keywords :
natural languages; speech coding; speech synthesis; synchronisation; Spanish; phase treatment; prosodic features; prosodic modification algorithm; shape-invariant pitch-synchronous conversion; sinusoidal implementations; sinusoidal shape-invariant model; speech coding; speech units concatenation; speech waveform; stored speech units; text-to-speech conversion; Data mining; Frequency domain analysis; Humans; Reverberation; Shape; Speech analysis; Speech processing; Speech synthesis; Telecommunication standards; Time domain analysis; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479683