• DocumentCode
    3034229
  • Title

    Diphone synthesis for phonetic vocoding

  • Author

    Schwartz, B. ; Klovstad, J. ; Makhoul, J. ; Klatt, D. ; Zue, V.

  • Author_Institution
    Bolt Beranek and Newman, Inc., Cambridge, Mass.
  • Volume
    4
  • fYear
    1979
  • fDate
    28946
  • Firstpage
    891
  • Lastpage
    894
  • Abstract
    We report on the synthesis of speech in the context of a phonetic vocoder operating at 100 b/s. With each phoneme, the vocoder transmits the duration and a single pitch value. The synthesizer uses a large inventory of diphone "models" to synthesize a desired phoneme string. The diphone inventory has been selected to differentiate between prevocalic and postvocalic allophones of sonorants, to account for changes in vowel color conditioned by postvocalic liquids, to allow exact specification of voice onset time, and to permit synthesis of glottal stops alveolar flaps and syllabic consonants. The diphones are extracted from carefully constructed short utterances and are stored as a sequence of LPC parameters. During synthesis, the requisite diphone models are time-warped, abutted and smoothed to produce a complete sequence of LPC parameters that are used in the synthesis. The algorithms used are described and compared with more conventional methods. Examples of the synthesized speech will be played.
  • Keywords
    Assembly systems; Fasteners; Interpolation; Joining processes; Linear predictive coding; Liquids; Speech synthesis; Steady-state; Synthesizers; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1979.1170600
  • Filename
    1170600