DocumentCode
555155
Title
Implementing autonomous shaping by critical states
Author
Jiong Song ; Zhao Jin
Author_Institution
Yunnan Jiao Tong Vocational & Tech. Coll., Kunming, China
Volume
1
fYear
2011
fDate
20-22 Aug. 2011
Firstpage
276
Lastpage
279
Abstract
Shaping is a powerful method for speeding up reinforcement learning, but the major drawback that shaping reward depends on external observer limits its application and requires significant effort. We implement an autonomous shaping reinforcement learning method by making agent can discover autonomously critical states from prior experience and use them to shape later learning. The critical state is a state that has high probability to exist in all these acyclic state trajectories that from the start state to the goal state, that means, if agent wants to reach the goal state, then it would have high likelihood to pass the critical states. So the critical states can be used to shape agent for reaching the goal state faster. The experiments on Maze problem show our method can significant improve agent´s performance. The more important is we make agent can shape its later learning by its prior experience.
Keywords
learning (artificial intelligence); probability; Maze problem; acyclic state trajectory; agent learning; agent performance; autonomous critical state discovery; autonomous shaping; external observer; probability; reinforcement learning; Learning; Learning systems; Machine learning; Observers; Shape; Training; Trajectory; critical State; prior experience; reinforcement learning; shaping; speeding up learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology and Artificial Intelligence Conference (ITAIC), 2011 6th IEEE Joint International
Conference_Location
Chongqing
Print_ISBN
978-1-4244-8622-9
Type
conf
DOI
10.1109/ITAIC.2011.6030203
Filename
6030203
Link To Document