• DocumentCode
    3585053
  • Title

    The use of discriminative belief tracking in POMDP-based dialogue systems

  • Author

    Dongho Kim ; Henderson, Matthew ; Gasic, Milica ; Tsiakoulis, Pirros ; Young, Steve

  • Author_Institution
    Dept. of Eng., Univ. of Cambridge, Cambridge, UK
  • fYear
    2014
  • Firstpage
    354
  • Lastpage
    359
  • Abstract
    Statistical spoken dialogue systems based on Partially Observable Markov Decision Processes (POMDPs) have been shown to be more robust to speech recognition errors by maintaining a belief distribution over multiple dialogue states and making policy decisions based on the entire distribution rather than the single most likely hypothesis. To date most POMDP-based systems have used generative trackers. However, concerns about modelling accuracy have created interest in discriminative methods, and recent results from the second Dialog State Tracking Challenge (DSTC2) have shown that discriminative trackers can significantly outperform generative models in terms of tracking accuracy. The aim of this paper is to investigate the extent to which these improvements translate into improved task completion rates when incorporated into a spoken dialogue system. To do this, the Recurrent Neural Network (RNN) tracker described by Henderson et al in DSTC2 was integrated into the Cambridge statistical dialogue system and compared with the existing generative Bayesian network tracker. Using a Gaussian Process (GP) based policy, the experimental results indicate that the system using the RNN tracker performs significantly better than the system with the original Bayesian network tracker.
  • Keywords
    Gaussian processes; Markov processes; interactive systems; recurrent neural nets; speech recognition; statistical analysis; Cambridge statistical dialogue system; DSTC2; GP based policy; Gaussian process based policy; POMDP-based dialogue systems; RNN tracker; belief distribution; dialog state tracking challenge; discriminative belief tracking; multiple dialogue states; partially observable Markov decision processes; policy decision making; recurrent neural network tracker; statistical spoken dialogue systems; Accuracy; Error analysis; Gaussian processes; Recurrent neural networks; Speech recognition; Training; Vectors; POMDP; belief tracking; dialogue management; recurrent neural networks; spoken dialogue systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2014 IEEE
  • Type

    conf

  • DOI
    10.1109/SLT.2014.7078600
  • Filename
    7078600