مرکز منطقه ای اطلاع رساني علوم و فناوري - A real-time French text-to-speech system generating high-quality synthetic speech

DocumentCode :

2877018

Title :

A real-time French text-to-speech system generating high-quality synthetic speech

Author :

Moulines, E. ; Emerard, F. ; Larreur, D. ; Milon, J. L Le Saint ; Faucheur, L. Le ; Marty, F. ; Charpentier, F. ; Sorin, C.

Author_Institution :

CNET LAA/TSS/RCP, Lannion, France

fYear :

1990

fDate :

3-6 Apr 1990

Firstpage :

309

Abstract :

The main features of the CNET diphone-based text-to-speech system for French language are described. The linguistic analysis works in three steps. First, a morphosyntactic analysis module assigns a grammatical value to each word in the text and transcribes it phonetically. A second module parses the text into hierarchical syntactico-prosodic groups. Finally, prosodic patterns are automatically assigned to each word by queries to a database of prosodic events. The phonetic and prosodic information serves as commands to the synthesis component. The synthesis component is based on diphone concatenation. A time-domain formulation of the pitch-synchronous overlap-add scheme (TD-PSOLA) is used to modify the speech prosody and to concatenate diphone waveforms. It is combined with a low bit-rate speech decoder to reduce the memory requirement for storing the diphone inventory. The system runs in real time on a PC equipped with a TMS320C25 DSP board and provides notably improved sound quality and naturalness in comparison to commercially available systems

Keywords :

computerised signal processing; decoding; real-time systems; speech synthesis; CNET; TMS320C25 DSP board; diphone concatenation; diphone-based system; grammatical value; hierarchical syntactico-prosodic groups; high-quality synthetic speech; linguistic analysis; low bit-rate speech decoder; morphosyntactic analysis module; phonetic information; pitch-synchronous overlap-add scheme; prosodic patterns; real-time French text-to-speech system; sound quality; time-domain formulation; Computer science education; Databases; Decoding; Digital signal processing; Information analysis; Laboratories; Natural languages; Real time systems; Speech coding; Speech synthesis; Time domain analysis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on

Conference_Location :

Albuquerque, NM

ISSN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.1990.115650

Filename :

115650

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2877018