• DocumentCode
    2158951
  • Title

    Bayesian reinforcement learning for POMDP-based dialogue systems

  • Author

    Png, ShaoWei ; Pineau, Joelle

  • Author_Institution
    Sch. of Comput. Sci., McGill Univ., Montreal, QC, Canada
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    2156
  • Lastpage
    2159
  • Abstract
    Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvements in robustness. However, past research on POMDPs-based dialogue system assumes that the model parameters are known. This limitation can be addressed through model-based Bayesian reinforcement learning, which offers a rich framework for simultaneous learning and planning. However, due to the high complexity of the framework, a major challenge is to scale up these algorithms for complex dialogue systems. In this work, we show that by exploiting certain known components of the system, such as knowledge of symmetrical properties, and using an approximate online planning algorithm, we are able to apply Bayesian RL on a realistic spoken dialogue system domain.
  • Keywords
    belief networks; interactive systems; learning (artificial intelligence); speech recognition; POMDP-based dialogue system; complex dialogue system; model based Bayesian reinforcement learning; online planning; speech recognition; spoken dialogue system; Bayesian methods; Computational modeling; Learning; Markov processes; Mathematical model; Planning; Robots; Bayesian Learning; POMDPs (Partially Observable Markov Decision Processes); Reinforcement Learning; Spoken Dialogue;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946754
  • Filename
    5946754