DocumentCode :
1609321
Title :
A Policy-Improving System with a Mixture of Bayesian Networks Adapting Agents to Continuously Changing Environments
Author :
Kitakoshi, Daisuke ; Shioya, Hiroyuki ; Nakano, Ryohei
Author_Institution :
Graduate Sch. of Eng., Nagoya Inst. of Technol.
fYear :
2006
Firstpage :
6031
Lastpage :
6036
Abstract :
A variety of adaptive learning systems which adapt themselves to complicated environments has been studied and developed in the broad field of AI researches. For example, many reinforcement learning (RL) methods have been proposed to adapt agents to the environments. At the same time, Bayesian network (BN), one of the stochastic models, has attracted increasing attention due to its noise robustness, reasoning power, etc. We have proposed a system improving RL agents´ policies with a mixture model of RNs, and have evaluated the adapting performance of our system. Each structure of BN can be regarded as a stochastic knowledge representation in the policy acquired through RL. It has been confirmed that the agent with our system could improve their policies by the information derived from the mixture, and then could adequately adapt to dynamically-switched environments. In this research, we propose a method to appropriately normalize mixing parameters of the mixture for the use in common adaptive learning systems, and evaluate the fundamental performance of our system in continuously-changing environment
Keywords :
adaptive systems; belief networks; learning systems; adaptive learning systems; continuously changing environments; dynamically switched environments; mixture of Bayesian networks; policy-improving system; reinforcement learning; stochastic knowledge representation; stochastic models; Adaptive systems; Artificial intelligence; Bayesian methods; Knowledge representation; Learning systems; Noise robustness; Power system modeling; Stochastic resonance; Stochastic systems; Working environment noise; Adapting to continuously-changing environments; Mixture of Bayesian networks; Reinforcement learing; Stochastic knowledge representation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE-ICASE, 2006. International Joint Conference
Conference_Location :
Busan
Print_ISBN :
89-950038-4-7
Electronic_ISBN :
89-950038-5-5
Type :
conf
DOI :
10.1109/SICE.2006.315202
Filename :
4108659
Link To Document :
بازگشت