DocumentCode
3521608
Title
A diphone synthesis system based on time-domain prosodic modifications of speech
Author
Hamon, Christian ; Mouline, E. ; Charpentier, Francis
Author_Institution
CNET, Lannion, France
fYear
1989
fDate
23-26 May 1989
Firstpage
238
Abstract
A novel time-domain algorithm is presented for text-to-speech synthesis using diphone concatenation. The algorithm is based on the pitch-synchronous overlap-add (PSOLA) approach and is capable of good quality prosodic modifications of natural speech. The algorithm can be seen as a simplification of a previous algorithm combining the PSOLA approach and frequency-domain transformations. On the other hand, it appears as a generalization of previous time-domain methods that perform pitch synchronous cut-and-splice operations on the speech waveform. This algorithm is used in the CNET diphone synthesis multilingual system, actually supporting three languages: French, Italian, and German. The resulting speech has been tested on French and is judged of much better quality than for an LPC-based synthesizer
Keywords
speech synthesis; CNET diphone synthesis multilingual system; French; German; Italian; PSOLA; diphone concatenation; diphone synthesis system; frequency-domain transformations; natural speech; pitch synchronous cut-and-splice operations; pitch-synchronous overlap-add; speech waveform; text-to-speech synthesis; time-domain algorithm; time-domain prosodic modifications; Concatenated codes; Distortion; Frequency domain analysis; Natural languages; Signal synthesis; Speech synthesis; Synthesizers; Testing; Time domain analysis; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location
Glasgow
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1989.266409
Filename
266409
Link To Document