A diphone synthesis system based on time-domain prosodic modifications of speech

Author

Hamon, Christian ; Mouline, E. ; Charpentier, Francis

Author_Institution

CNET, Lannion, France

fYear

1989

fDate

23-26 May 1989

Firstpage

238

Abstract

A novel time-domain algorithm is presented for text-to-speech synthesis using diphone concatenation. The algorithm is based on the pitch-synchronous overlap-add (PSOLA) approach and is capable of good quality prosodic modifications of natural speech. The algorithm can be seen as a simplification of a previous algorithm combining the PSOLA approach and frequency-domain transformations. On the other hand, it appears as a generalization of previous time-domain methods that perform pitch synchronous cut-and-splice operations on the speech waveform. This algorithm is used in the CNET diphone synthesis multilingual system, actually supporting three languages: French, Italian, and German. The resulting speech has been tested on French and is judged of much better quality than for an LPC-based synthesizer

Keywords

speech synthesis; CNET diphone synthesis multilingual system; French; German; Italian; PSOLA; diphone concatenation; diphone synthesis system; frequency-domain transformations; natural speech; pitch synchronous cut-and-splice operations; pitch-synchronous overlap-add; speech waveform; text-to-speech synthesis; time-domain algorithm; time-domain prosodic modifications; Concatenated codes; Distortion; Frequency domain analysis; Natural languages; Signal synthesis; Speech synthesis; Synthesizers; Testing; Time domain analysis; Wideband;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on

Conference_Location

Glasgow

ISSN

1520-6149

Type

conf

DOI

10.1109/ICASSP.1989.266409

Filename

266409