DocumentCode :
2877018
Title :
A real-time French text-to-speech system generating high-quality synthetic speech
Author :
Moulines, E. ; Emerard, F. ; Larreur, D. ; Milon, J. L Le Saint ; Faucheur, L. Le ; Marty, F. ; Charpentier, F. ; Sorin, C.
Author_Institution :
CNET LAA/TSS/RCP, Lannion, France
fYear :
1990
fDate :
3-6 Apr 1990
Firstpage :
309
Abstract :
The main features of the CNET diphone-based text-to-speech system for French language are described. The linguistic analysis works in three steps. First, a morphosyntactic analysis module assigns a grammatical value to each word in the text and transcribes it phonetically. A second module parses the text into hierarchical syntactico-prosodic groups. Finally, prosodic patterns are automatically assigned to each word by queries to a database of prosodic events. The phonetic and prosodic information serves as commands to the synthesis component. The synthesis component is based on diphone concatenation. A time-domain formulation of the pitch-synchronous overlap-add scheme (TD-PSOLA) is used to modify the speech prosody and to concatenate diphone waveforms. It is combined with a low bit-rate speech decoder to reduce the memory requirement for storing the diphone inventory. The system runs in real time on a PC equipped with a TMS320C25 DSP board and provides notably improved sound quality and naturalness in comparison to commercially available systems
Keywords :
computerised signal processing; decoding; real-time systems; speech synthesis; CNET; TMS320C25 DSP board; diphone concatenation; diphone-based system; grammatical value; hierarchical syntactico-prosodic groups; high-quality synthetic speech; linguistic analysis; low bit-rate speech decoder; morphosyntactic analysis module; phonetic information; pitch-synchronous overlap-add scheme; prosodic patterns; real-time French text-to-speech system; sound quality; time-domain formulation; Computer science education; Databases; Decoding; Digital signal processing; Information analysis; Laboratories; Natural languages; Real time systems; Speech coding; Speech synthesis; Time domain analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.1990.115650
Filename :
115650
Link To Document :
بازگشت