Title :
The (non)utility of linguistic features for predicting prominence in spontaneous speech
Author :
Brenier, J.M. ; Nenkova, A. ; Kothari, A. ; Whitton, L. ; Beaver, D. ; Jurafsky, D.
Author_Institution :
Dept. of Linguistics, Stanford Univ., Stanford, CA
Abstract :
Conversational speech is characterized by prosodic variability which makes pitch accent prediction for this genre especially difficult. The linguistic literature points out that complex features such as information status, contrast and animacy help predict pitch accent placement. In this paper, we use a corpus annotated for such features to determine if they improve prominence prediction over traditional shallow features such as frequency and part-of-speech, or over new ones that we introduce. We demonstrate that while correlated with prominence, complex linguistic features do not improve prediction accuracy. Furthermore, the performance of our classifier is quite close to the ceiling defined by variability in human accent placement. An oracle experiment demonstrates, though, that at least some accuracy improvement is still possible.
Keywords :
linguistics; speech processing; conversational speech; human accent placement; linguistic features; pitch accent prediction; spontaneous speech; Accuracy; Concrete; Frequency; Guidelines; Humans; Probability; Robustness; Speech synthesis;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326815