Title :
A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization
Author :
Daubigney, Lucie ; Geist, Matthieu ; ChandraMohan, S. ; Pietquin, Olivier
Author_Institution :
IMS-MaLIS Res. Group, Metz, France
Abstract :
Reinforcement learning is now an acknowledged approach for optimizing the interaction strategy of spoken dialogue systems. If the first considered algorithms were quite basic (like SARSA), recent works concentrated on more sophisticated methods. More attention has been paid to off-policy learning, dealing with the exploration-exploitation dilemma, sample efficiency or handling non-stationarity. New algorithms have been proposed to address these issues and have been applied to dialogue management. However, each algorithm often solves a single issue at a time, while dialogue systems exhibit all the problems at once. In this paper, we propose to apply the Kalman Temporal Differences (KTD) framework to the problem of dialogue strategy optimization so as to address all these issues in a comprehensive manner with a single framework. Our claims are illustrated by experiments led on two real-world goal-oriented dialogue management frameworks, DIPPER and HIS.
Keywords :
interactive systems; learning (artificial intelligence); DIPPER goal-oriented dialogue management frameworks; HIS goal-oriented dialogue management frameworks; KTD framework; Kalman temporal differences framework; comprehensive reinforcement learning framework; dialogue management optimization; dialogue strategy optimization; exploration-exploitation dilemma; off-policy learning; spoken dialogue systems; Heuristic algorithms; Kalman filters; Learning; Markov processes; Signal processing algorithms; Dialogue management; reinforcement learning; spoken dialogue system;
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
DOI :
10.1109/JSTSP.2012.2229257