• DocumentCode
    1135121
  • Title

    Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering

  • Author

    Yu, Dong ; Acero, Alex

  • Author_Institution
    Speech Res. Group, Microsoft Res., Redmond, WA, USA
  • Volume
    13
  • Issue
    5
  • fYear
    2005
  • Firstpage
    661
  • Lastpage
    671
  • Abstract
    While many successful spoken dialog systems have been deployed over telephone networks in recent years, the high cost of developing such applications has led to limited adoption. Despite large research efforts in user-initiative and mixed-initiative systems, most commercial applications follow a system initiative approach because they are simpler to design and are found to work adequately. Yet, even designing such system-initiative spoken dialog systems has proven costly when compared with simpler touchtone systems. To address this issue, we describe in this paper our efforts in building diagnostics tools to let nonexperienced speech developers write usable applications without the need for transcribing calls. Our approach consists of two steps. In the first step, we cluster calls based on Question/Answer (QA) states and transitions, analyze the success rates associated with each QA state and transition, and identify the most problematic QA states and transitions based on a criterion we call Arc Cut Gain in Success Rate (ACGSR). In the second step, we cluster calls associated with problematic QA transitions through an approach we term Interactive Clustering (IC). The purpose of this step is to automatically cluster calls that are similar to those already labeled by the developers to maximize productivity. Experiments on an internal auto-attendant application show that our approach can significantly reduce the time and effort needed to identify problems in spoken dialog applications.
  • Keywords
    interactive systems; speech processing; speech-based user interfaces; arc cut gain in success rate; call transition diagram; interactive clustering; mixed-initiative systems; semiautomatic improvements; simpler touchtone systems; speech recognition; spoken dialog systems; system-initiative spoken dialog; Automatic speech recognition; Buildings; Costs; Customer service; Data mining; Portals; Productivity; Speech analysis; Speech recognition; Telephony; Automatic analysis; call transition diagram; data mining; model-based clustering; semi-supervised clustering; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2005.851876
  • Filename
    1495447