DocumentCode
2876041
Title
Detection of questions in Chinese conversational speech
Author
Yuan, Jiahong ; Jurafsky, Dan
Author_Institution
Stanford Univ., CA
fYear
2005
fDate
27-27 Nov. 2005
Firstpage
47
Lastpage
52
Abstract
What features are helpful for Chinese question detection? Which of them are more important? What are the differences between Chinese and English regarding feature importance? We study these questions by building question detectors for Chinese and English conversational speech, and performing analytic studies and feature selection experiments. As in English, we find that both textual and prosodic features are helpful for Chinese question detection. Among textual features, word identities, especially the utterance-final word, are more useful than the global (N-gram) sentence likelihood. Unlike in English, where final pitch rise is a good cue for questions, we find in Chinese that utterance final pitch behavior is not a good feature. Instead, the most useful prosodic feature is the spectral balance, i.e., the distribution of energy over the frequency spectrum, of the final syllable. We also find effects of tone, e.g., that treating interjection words as having a special tone is helpful. Our final classifier achieves an error rate of 14.9% with respect to a 50% chance-level rate
Keywords
feature extraction; natural languages; speech recognition; Chinese conversational speech; Chinese question detection; English conversational speech; N-gram sentence likelihood; feature selection; textual features; utterance-final word; word identities; Acoustic signal detection; Computer vision; Data mining; Detectors; Error analysis; Frequency; Maximum likelihood detection; Natural languages; Performance analysis; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location
San Juan
Print_ISBN
0-7803-9478-X
Electronic_ISBN
0-7803-9479-8
Type
conf
DOI
10.1109/ASRU.2005.1566536
Filename
1566536
Link To Document