• DocumentCode
    1857730
  • Title

    The (non)utility of linguistic features for predicting prominence in spontaneous speech

  • Author

    Brenier, J.M. ; Nenkova, A. ; Kothari, A. ; Whitton, L. ; Beaver, D. ; Jurafsky, D.

  • Author_Institution
    Dept. of Linguistics, Stanford Univ., Stanford, CA
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    54
  • Lastpage
    57
  • Abstract
    Conversational speech is characterized by prosodic variability which makes pitch accent prediction for this genre especially difficult. The linguistic literature points out that complex features such as information status, contrast and animacy help predict pitch accent placement. In this paper, we use a corpus annotated for such features to determine if they improve prominence prediction over traditional shallow features such as frequency and part-of-speech, or over new ones that we introduce. We demonstrate that while correlated with prominence, complex linguistic features do not improve prediction accuracy. Furthermore, the performance of our classifier is quite close to the ceiling defined by variability in human accent placement. An oracle experiment demonstrates, though, that at least some accuracy improvement is still possible.
  • Keywords
    linguistics; speech processing; conversational speech; human accent placement; linguistic features; pitch accent prediction; spontaneous speech; Accuracy; Concrete; Frequency; Guidelines; Humans; Probability; Robustness; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326815
  • Filename
    4123360