Title :
Modular Neural Networks for Reinforcement Learning with Temporal Intrinsic Rewards
Author :
Takeuchi, Johane ; Shouno, Osamu ; Tsujino, Hiroshi
Author_Institution :
Honda Res. Inst. Japan Co., Ltd., Saitama
Abstract :
Inspired by intrinsic motivation that is thought to play a crucial role in animal development and learning, several artificial learning systems with built in intrinsic rewards were recently studied. Here we suggest an intrinsically rewarded learning system for autonomous task achievements that copes with several kinds of transitions. The system consists of neural networks equipped with a modular reinforcement learning algorithm. The modular system that decomposes the observed state space stabilizes the intrinsic rewards calculated from prediction errors. On-line learning via the proposed system takes place under various kinds of transitions, including deterministic, probabilistic and partially observable, without any specific adjustments of parameters for each transition. The combined system with both the modular network and the intrinsic reward generator led to performance that converged to the optimal sequences of actions in all tested transitions, in which external rewards were delivered only at the completion of tasks.
Keywords :
learning (artificial intelligence); neural nets; intrinsically rewarded learning system; modular neural network; online learning; optimal sequence; reinforcement learning; temporal intrinsic reward; Accelerated aging; Acceleration; Animals; Artificial neural networks; Function approximation; Learning systems; Neural networks; Predictive models; Robots; State-space methods;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371120