• DocumentCode
    3425145
  • Title

    A study of JEMA for intonation modeling

  • Author

    Aguero, P.D. ; Bonafonte, Antonio

  • Author_Institution
    Fac. of Eng., Univ. of Mar del Plata, Mar del Plata
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4625
  • Lastpage
    4628
  • Abstract
    In the literature many intonation models are trained using parameters extracted sentence-by-sentence on contours interpolated in the unvoiced segments. This may introduce a bias in the final parameters and a reduction of the generalization of the model due to the increased dispersion of them. Recently, we have proposed JEMA, a joint extraction and prediction approach for intonation modeling that avoids such assumption. The parameter extraction and model training are combined in a loop where i) the model is successively refined, and ii) the parameters are extracted using information provided by the model. In this papers we present experiments based on synthetic data to evaluate this approach in a controlled environment. Both, the results with synthetic data and with natural speech, show that the use of JEMA is clearly superior to the standard estimation approach. The parameters are correctly extracted using several degrees of missing data (0% to 80%) and Gaussian noise. In fact, the study shows that including JEMA in the training algorithm is even more relevant than the selection of a particular representation of the intonation contours, as Fujisaki, Bezier, Tilt, or others.
  • Keywords
    Gaussian noise; interpolation; natural language processing; speech processing; speech synthesis; Gaussian noise; JEMA; contours interpolation; intonation modeling; natural speech; sentence; unvoiced segments; Data mining; Frequency estimation; Mathematical model; Natural languages; Parameter estimation; Parameter extraction; Predictive models; Smoothing methods; Speech synthesis; Training data; Intonation Modeling; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518687
  • Filename
    4518687