• DocumentCode
    730744
  • Title

    Atom decomposition-based intonation modelling

  • Author

    Honnet, Pierre-Edouard ; Gerazov, Branislav ; Garner, Philip N.

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4744
  • Lastpage
    4748
  • Abstract
    Current statistical parametric text-to-speech (TTS) synthesis methods allow production of neutral speech with acceptable quality. However, prosody is often qualified as unsatisfactory and sounding too flat. In this paper, we address intonation modelling for TTS based on physiological aspects of prosody production. A set of gamma distribution shaped atoms is defined and then intonation decomposition is performed using a matching pursuit algorithm. Some preliminary experiments show that this model allows easy extraction of physiologically meaningful atoms that could be used to generate intonation in a TTS system.
  • Keywords
    decomposition; gamma distribution; speech enhancement; speech processing; speech synthesis; statistical analysis; atom decomposition-based intonation modelling; gamma distribution shaped atoms; intonation decomposition; matching pursuit algorithm; neutral speech production; prosody production; statistical parametric text-to-speech synthesis method; Atomic layer deposition; Matching pursuit algorithms; Intonation modelling; matching pursuit; physiology; text-to-speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178871
  • Filename
    7178871