Title :
Building Adaptive Dialogue Systems Via Bayes-Adaptive POMDPs
Author :
Shaowei Png ; Pineau, Joelle ; Chaib-draa, B.
Author_Institution :
Sch. of Comput. Sci., McGill Univ., Montreal, QC, Canada
Abstract :
Recent research has shown that effective dialogue management can be achieved through the Partially Observable Markov Decision Process (POMDP) framework. However past research on POMDP-based dialogue systems usually assumed the parameters of the decision process were known a priori. The main contribution of this paper is to present a Bayesian reinforcement learning framework for learning the POMDP parameters online from data, in a decision-theoretic manner. We discuss various approximations and assumptions which can be leveraged to ensure computational tractability, and apply these techniques to learning observation models for several simulated spoken dialogue domains.
Keywords :
Bayes methods; Markov processes; decision theory; interactive systems; learning (artificial intelligence); speech recognition; Bayesian reinforcement learning; POMDP; adaptive dialogue system; computational tractability; decision theory; dialogue management; partially observable Markov decision process; spoken dialogue domain; Bayesian methods; Learning; Markov processes; Speech recognition; User interfaces; Bayesian inference; Dialogue management; Markov decision process (MDP); partially observable Markov decision process (POMDP); reinforcement learning;
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
DOI :
10.1109/JSTSP.2012.2229962