DocumentCode :
2970981
Title :
The exploration/exploitation trade-off in Reinforcement Learning for dialogue management
Author :
Varges, Sebastian ; Riccardi, Giuseppe ; Quarteroni, Silvia ; Ivanov, Alexei V.
Author_Institution :
Dept. of Inf. Eng. & Comput. Sci., Univ. of Trento, Povo di Trento, Italy
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
479
Lastpage :
484
Abstract :
Conversational systems use deterministic rules that trigger actions such as requests for confirmation or clarification. More recently, reinforcement learning and (partially observable) Markov decision processes have been proposed for this task. In this paper, we investigate action selection strategies for dialogue management, in particular the exploration/exploitation trade-off and its impact on final reward (i.e. the session reward after optimization has ended) and lifetime reward (i.e. the overall reward accumulated over the learner´s lifetime). We propose to use interleaved exploitation sessions as a learning methodology to assess the reward obtained from the current policy. The experiments show a statistically significant difference in final reward of exploitation-only sessions between a system that optimizes lifetime reward and one that maximizes the reward of the final policy.
Keywords :
Markov processes; interactive systems; learning (artificial intelligence); speech recognition; Markov decision process; action selection; conversational system; deterministic rule; dialogue management; exploration-exploitation trade-off; lifetime reward; reinforcement learning; session reward; speech recognition; Computer science; Delta modulation; Engineering management; Humans; Machine learning; Noise level; Noise robustness; Speech recognition; Supervised learning; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373260
Filename :
5373260
Link To Document :
بازگشت