• DocumentCode
    806736
  • Title

    An Integrative and Discriminative Technique for Spoken Utterance Classification

  • Author

    Yaman, Sibel ; Deng, Li ; Yu, Dong ; Wang, Ye-Yi ; Acero, Alex

  • Author_Institution
    Microsoft Corp., Redmond, WA
  • Volume
    16
  • Issue
    6
  • fYear
    2008
  • Firstpage
    1207
  • Lastpage
    1214
  • Abstract
    Traditional methods of spoken utterance classification (SUC) adopt two independently trained phases. In the first phase, an automatic speech recognition (ASR) module returns the most likely sentence for the observed acoustic signal. In the second phase, a semantic classifier transforms the resulting sentence into the most likely semantic class. Since the two phases are isolated from each other, such traditional SUC systems are suboptimal. In this paper, we present a novel integrative and discriminative learning technique for SUC to alleviate this problem, and thereby, reduce the semantic classification error rate (CER). Our approach revolves around the effective use of the N-best lists generated by the ASR module to reduce semantic classification errors. The N-best list sentences are first rescored using all the available knowledge sources. Then, the sentence that is most likely to helps reduce the CER are extracted from the N-best lists as well as those sentences that are most likely to increase the CER. These sentences are used to discriminatively train the language and semantic-classifier models to minimize the overall semantic CER. Our experiments resulted in a reduction of CER from its initial value of 4.92% to 4.04% in the standard ATIS task.
  • Keywords
    error statistics; signal classification; speech recognition; acoustic signal; automatic speech recognition; classification error rate; discriminative technique; integrative technique; semantic-classifier models; spoken utterance classification; Automatic speech recognition; Command and control systems; Error analysis; Natural languages; Routing; Speech processing; Speech recognition; Automatic speech recognition (ASR); discriminative training; spoken language understanding (SLU); spoken utterance classification (SUC); statistical language modeling;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.2001106
  • Filename
    4566089