DocumentCode
730744
Title
Atom decomposition-based intonation modelling
Author
Honnet, Pierre-Edouard ; Gerazov, Branislav ; Garner, Philip N.
Author_Institution
Idiap Res. Inst., Martigny, Switzerland
fYear
2015
fDate
19-24 April 2015
Firstpage
4744
Lastpage
4748
Abstract
Current statistical parametric text-to-speech (TTS) synthesis methods allow production of neutral speech with acceptable quality. However, prosody is often qualified as unsatisfactory and sounding too flat. In this paper, we address intonation modelling for TTS based on physiological aspects of prosody production. A set of gamma distribution shaped atoms is defined and then intonation decomposition is performed using a matching pursuit algorithm. Some preliminary experiments show that this model allows easy extraction of physiologically meaningful atoms that could be used to generate intonation in a TTS system.
Keywords
decomposition; gamma distribution; speech enhancement; speech processing; speech synthesis; statistical analysis; atom decomposition-based intonation modelling; gamma distribution shaped atoms; intonation decomposition; matching pursuit algorithm; neutral speech production; prosody production; statistical parametric text-to-speech synthesis method; Atomic layer deposition; Matching pursuit algorithms; Intonation modelling; matching pursuit; physiology; text-to-speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178871
Filename
7178871
Link To Document