DocumentCode :
730744
Title :
Atom decomposition-based intonation modelling
Author :
Honnet, Pierre-Edouard ; Gerazov, Branislav ; Garner, Philip N.
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
4744
Lastpage :
4748
Abstract :
Current statistical parametric text-to-speech (TTS) synthesis methods allow production of neutral speech with acceptable quality. However, prosody is often qualified as unsatisfactory and sounding too flat. In this paper, we address intonation modelling for TTS based on physiological aspects of prosody production. A set of gamma distribution shaped atoms is defined and then intonation decomposition is performed using a matching pursuit algorithm. Some preliminary experiments show that this model allows easy extraction of physiologically meaningful atoms that could be used to generate intonation in a TTS system.
Keywords :
decomposition; gamma distribution; speech enhancement; speech processing; speech synthesis; statistical analysis; atom decomposition-based intonation modelling; gamma distribution shaped atoms; intonation decomposition; matching pursuit algorithm; neutral speech production; prosody production; statistical parametric text-to-speech synthesis method; Atomic layer deposition; Matching pursuit algorithms; Intonation modelling; matching pursuit; physiology; text-to-speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178871
Filename :
7178871
Link To Document :
بازگشت