• DocumentCode
    2876041
  • Title

    Detection of questions in Chinese conversational speech

  • Author

    Yuan, Jiahong ; Jurafsky, Dan

  • Author_Institution
    Stanford Univ., CA
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    47
  • Lastpage
    52
  • Abstract
    What features are helpful for Chinese question detection? Which of them are more important? What are the differences between Chinese and English regarding feature importance? We study these questions by building question detectors for Chinese and English conversational speech, and performing analytic studies and feature selection experiments. As in English, we find that both textual and prosodic features are helpful for Chinese question detection. Among textual features, word identities, especially the utterance-final word, are more useful than the global (N-gram) sentence likelihood. Unlike in English, where final pitch rise is a good cue for questions, we find in Chinese that utterance final pitch behavior is not a good feature. Instead, the most useful prosodic feature is the spectral balance, i.e., the distribution of energy over the frequency spectrum, of the final syllable. We also find effects of tone, e.g., that treating interjection words as having a special tone is helpful. Our final classifier achieves an error rate of 14.9% with respect to a 50% chance-level rate
  • Keywords
    feature extraction; natural languages; speech recognition; Chinese conversational speech; Chinese question detection; English conversational speech; N-gram sentence likelihood; feature selection; textual features; utterance-final word; word identities; Acoustic signal detection; Computer vision; Data mining; Detectors; Error analysis; Frequency; Maximum likelihood detection; Natural languages; Performance analysis; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566536
  • Filename
    1566536