• DocumentCode
    2838950
  • Title

    Intensity in relation to prosody organization

  • Author

    Tseng, Chiu-Yu ; Lee, Yelling

  • Author_Institution
    Inst. of Linguistics, Acad. Sinica, Taipei, Taiwan
  • fYear
    2004
  • fDate
    15-18 Dec. 2004
  • Firstpage
    217
  • Lastpage
    220
  • Abstract
    Mandarin fluent speech prosody is most significantly characterized by phrase grouping. A hierarchical prosody framework of phrase grouping has been proposed, where corresponding evidence of governing effects from the prosody organization were found in two acoustic correlates, namely, overall F0 contours and temporal allocations (Tseng, C. et al., "From Traditional Phonology to Mandarin Speech Processing", Beijing Foreign Language Teaching and Research Process, p.417-38, 2004). We present results of investigating, through corpus analyses, a third acoustic correlate, i.e., intensity, to look for corresponding evidence in relation to prosody organization as well. The specific questions raised are: (1) how the intensity pattern can be explained by the governing effect of prosody organization; (2) whether the governing effect can be used in predicting intensity distribution. We argue that the acoustic roles of speech rhythm and intensity are as much an integrated part of speech prosody as F0 contour patterns. Therefore, we conclude that in order to construct a working prosody model, all three acoustic correlates should be considered in relation to prosody organization. The conclusion is also directly applicable to TTS to improve output naturalness.
  • Keywords
    linguistics; natural languages; speech; speech processing; Mandarin fluent speech prosody; TTS; acoustic correlates; contour patterns; corpus analyses; governing effect; intensity pattern; phrase grouping; speech prosody; speech rhythm; temporal allocations; Bismuth; Constitution; Electrostatic precipitators; Humans; Labeling; Predictive models; Rhythm; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2004 International Symposium on
  • Print_ISBN
    0-7803-8678-7
  • Type

    conf

  • DOI
    10.1109/CHINSL.2004.1409625
  • Filename
    1409625