• DocumentCode
    1660289
  • Title

    Modeling the intensity of syllables using classification and Regression Trees

  • Author

    Vempada, Ramu Reddy ; Rao, K. Sreenivasa

  • Author_Institution
    Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The quality of the synthesized speech of text-to-speech (TTS) synthesis systems can be improved by controlling the intensities of speech segments in addition to other prosodic features such as intonation and duration. In this paper we proposed Classification and Regression Tree (CART) to model the intensities of syllables. Positional, contextual and phonological features associated to syllables are proposed to model the intensities. The proposed CART model is evaluated by means of objective measures such as average prediction error (μ), standard deviation (σ), correlation coefficient (γX,Y) and the percentage of syllables predicted within different deviations. From the studies we find that 82% of the syllable intensities could be predicted from the models within 7% deviation. The contribution of individual features in modeling the syllable intensities is also analysed. The proposed model is also evaluated by means of subjective listening tests on the synthesized speech generated by incorporating the predicted syllable intensities.
  • Keywords
    pattern classification; prediction theory; regression analysis; speech intelligibility; speech synthesis; CART; classification and regression tree; correlation coefficient; phonological features; predicted syllable intensities; prediction error; speech segments intensities; standard deviation; synthesized speech generation; synthesized speech quality; text-to-speech synthesis systems; Computational modeling; Context modeling; Pragmatics; Predictive models; Regression tree analysis; Speech; Speech synthesis; CART; Contextual; Intelligibility; MOS; Naturalness; Objective; Phonological; Positional; Subjective; TTS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2012 National Conference on
  • Conference_Location
    Kharagpur
  • Print_ISBN
    978-1-4673-0815-1
  • Type

    conf

  • DOI
    10.1109/NCC.2012.6176824
  • Filename
    6176824