• DocumentCode
    3521608
  • Title

    A diphone synthesis system based on time-domain prosodic modifications of speech

  • Author

    Hamon, Christian ; Mouline, E. ; Charpentier, Francis

  • Author_Institution
    CNET, Lannion, France
  • fYear
    1989
  • fDate
    23-26 May 1989
  • Firstpage
    238
  • Abstract
    A novel time-domain algorithm is presented for text-to-speech synthesis using diphone concatenation. The algorithm is based on the pitch-synchronous overlap-add (PSOLA) approach and is capable of good quality prosodic modifications of natural speech. The algorithm can be seen as a simplification of a previous algorithm combining the PSOLA approach and frequency-domain transformations. On the other hand, it appears as a generalization of previous time-domain methods that perform pitch synchronous cut-and-splice operations on the speech waveform. This algorithm is used in the CNET diphone synthesis multilingual system, actually supporting three languages: French, Italian, and German. The resulting speech has been tested on French and is judged of much better quality than for an LPC-based synthesizer
  • Keywords
    speech synthesis; CNET diphone synthesis multilingual system; French; German; Italian; PSOLA; diphone concatenation; diphone synthesis system; frequency-domain transformations; natural speech; pitch synchronous cut-and-splice operations; pitch-synchronous overlap-add; speech waveform; text-to-speech synthesis; time-domain algorithm; time-domain prosodic modifications; Concatenated codes; Distortion; Frequency domain analysis; Natural languages; Signal synthesis; Speech synthesis; Synthesizers; Testing; Time domain analysis; Wideband;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
  • Conference_Location
    Glasgow
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1989.266409
  • Filename
    266409