Title :
An action-selection strategy insensitive to parameter-settings in reinforcement learning
Author :
Ono, Kenji ; Iwata, Kazunori ; Hayashi, Akira
Author_Institution :
Grad. Sch. of Inf. Sci., Hiroshima City Univ., Hiroshima, Japan
Abstract :
Markov decision processes are one of the most popular frameworks for reinforcement learning. The entropy of probability density functions of Markov decision processes is referred to as the stochastic complexity. The stochastic complexity is helpful for tuning the parameters of an action-selection strategy to alleviate the exploration-exploitation dilemma. In this paper, we improve an action-selection strategy to make it insensitive to parameter-settings by using the stochastic complexity. This gives better policies for alleviating the above dilemma in most parameter-settings.
Keywords :
Markov processes; entropy; learning (artificial intelligence); Markov decision processes; action-selection strategy; entropy; exploration-exploitation dilemma; parameter tuning; parameter-settings; probability density functions; reinforcement learning; stochastic complexity; Adaptive systems; Entropy; Information theory; Learning; Probability density function; Stochastic processes; Stochastic systems; Markov Decision Process; Reinforcement Learning; Softmax Method;
Conference_Titel :
ICCAS-SICE, 2009
Conference_Location :
Fukuoka
Print_ISBN :
978-4-907764-34-0
Electronic_ISBN :
978-4-907764-33-3