A study of JEMA for intonation modeling

Author

Aguero, P.D. ; Bonafonte, Antonio

Author_Institution

Fac. of Eng., Univ. of Mar del Plata, Mar del Plata

fYear

2008

fDate

March 31 2008-April 4 2008

Firstpage

4625

Lastpage

4628

Abstract

In the literature many intonation models are trained using parameters extracted sentence-by-sentence on contours interpolated in the unvoiced segments. This may introduce a bias in the final parameters and a reduction of the generalization of the model due to the increased dispersion of them. Recently, we have proposed JEMA, a joint extraction and prediction approach for intonation modeling that avoids such assumption. The parameter extraction and model training are combined in a loop where i) the model is successively refined, and ii) the parameters are extracted using information provided by the model. In this papers we present experiments based on synthetic data to evaluate this approach in a controlled environment. Both, the results with synthetic data and with natural speech, show that the use of JEMA is clearly superior to the standard estimation approach. The parameters are correctly extracted using several degrees of missing data (0% to 80%) and Gaussian noise. In fact, the study shows that including JEMA in the training algorithm is even more relevant than the selection of a particular representation of the intonation contours, as Fujisaki, Bezier, Tilt, or others.

Keywords

Gaussian noise; interpolation; natural language processing; speech processing; speech synthesis; Gaussian noise; JEMA; contours interpolation; intonation modeling; natural speech; sentence; unvoiced segments; Data mining; Frequency estimation; Mathematical model; Natural languages; Parameter estimation; Parameter extraction; Predictive models; Smoothing methods; Speech synthesis; Training data; Intonation Modeling; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location

Las Vegas, NV

ISSN

1520-6149

Print_ISBN

978-1-4244-1483-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2008.4518687

Filename

4518687