• DocumentCode
    763597
  • Title

    The AT&T spoken language understanding system

  • Author

    Gupta, Narendra ; Tur, Gokhan ; Hakkani-Tur, Dilek ; Bangalore, Srinivas ; Riccardi, Giuseppe ; Gilbert, Mazin

  • Author_Institution
    AT&T Labs.-Res., USA
  • Volume
    14
  • Issue
    1
  • fYear
    2006
  • Firstpage
    213
  • Lastpage
    222
  • Abstract
    Spoken language understanding (SLU) aims at extracting meaning from natural language speech. Over the past decade, a variety of practical goal-oriented spoken dialog systems have been built for limited domains. SLU in these systems ranges from understanding predetermined phrases through fixed grammars, extracting some predefined named entities, extracting users´ intents for call classification, to combinations of users´ intents and named entities. In this paper, we present the SLU system of VoiceTone® (a service provided by AT&T where AT&T develops, deploys and hosts spoken dialog applications for enterprise customers). The SLU system includes extracting both intents and the named entities from the users´ utterances. For intent determination, we use statistical classifiers trained from labeled data, and for named entity extraction we use rule-based fixed grammars. The focus of our work is to exploit data and to use machine learning techniques to create scalable SLU systems which can be quickly deployed for new domains with minimal human intervention. These objectives are achieved by 1) using the predicate-argument representation of semantic content of an utterance; 2) extending statistical classifiers to seamlessly integrate hand crafted classification rules with the rules learned from data; and 3) developing an active learning framework to minimize the human labeling effort for quickly building the classifier models and adapting them to changes. We present an evaluation of this system using two deployed applications of VoiceTone®.
  • Keywords
    feature extraction; learning (artificial intelligence); natural languages; pattern classification; speech processing; statistical analysis; AT&T spoken language understanding system; VoiceTone; active learning framework; call classification; goal-oriented spoken dialog systems; intent determination; machine learning techniques; named entity extraction; natural language speech; rule-based fixed grammars; statistical classifiers; Computer science; Control systems; Data mining; Humans; Labeling; Machine learning; Natural languages; Packaging machines; Postal services; Speech; Named entities; semantic classification; semantic representation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.854085
  • Filename
    1561278