DocumentCode
3425145
Title
A study of JEMA for intonation modeling
Author
Aguero, P.D. ; Bonafonte, Antonio
Author_Institution
Fac. of Eng., Univ. of Mar del Plata, Mar del Plata
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4625
Lastpage
4628
Abstract
In the literature many intonation models are trained using parameters extracted sentence-by-sentence on contours interpolated in the unvoiced segments. This may introduce a bias in the final parameters and a reduction of the generalization of the model due to the increased dispersion of them. Recently, we have proposed JEMA, a joint extraction and prediction approach for intonation modeling that avoids such assumption. The parameter extraction and model training are combined in a loop where i) the model is successively refined, and ii) the parameters are extracted using information provided by the model. In this papers we present experiments based on synthetic data to evaluate this approach in a controlled environment. Both, the results with synthetic data and with natural speech, show that the use of JEMA is clearly superior to the standard estimation approach. The parameters are correctly extracted using several degrees of missing data (0% to 80%) and Gaussian noise. In fact, the study shows that including JEMA in the training algorithm is even more relevant than the selection of a particular representation of the intonation contours, as Fujisaki, Bezier, Tilt, or others.
Keywords
Gaussian noise; interpolation; natural language processing; speech processing; speech synthesis; Gaussian noise; JEMA; contours interpolation; intonation modeling; natural speech; sentence; unvoiced segments; Data mining; Frequency estimation; Mathematical model; Natural languages; Parameter estimation; Parameter extraction; Predictive models; Smoothing methods; Speech synthesis; Training data; Intonation Modeling; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518687
Filename
4518687
Link To Document