• DocumentCode
    555155
  • Title

    Implementing autonomous shaping by critical states

  • Author

    Jiong Song ; Zhao Jin

  • Author_Institution
    Yunnan Jiao Tong Vocational & Tech. Coll., Kunming, China
  • Volume
    1
  • fYear
    2011
  • fDate
    20-22 Aug. 2011
  • Firstpage
    276
  • Lastpage
    279
  • Abstract
    Shaping is a powerful method for speeding up reinforcement learning, but the major drawback that shaping reward depends on external observer limits its application and requires significant effort. We implement an autonomous shaping reinforcement learning method by making agent can discover autonomously critical states from prior experience and use them to shape later learning. The critical state is a state that has high probability to exist in all these acyclic state trajectories that from the start state to the goal state, that means, if agent wants to reach the goal state, then it would have high likelihood to pass the critical states. So the critical states can be used to shape agent for reaching the goal state faster. The experiments on Maze problem show our method can significant improve agent´s performance. The more important is we make agent can shape its later learning by its prior experience.
  • Keywords
    learning (artificial intelligence); probability; Maze problem; acyclic state trajectory; agent learning; agent performance; autonomous critical state discovery; autonomous shaping; external observer; probability; reinforcement learning; Learning; Learning systems; Machine learning; Observers; Shape; Training; Trajectory; critical State; prior experience; reinforcement learning; shaping; speeding up learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Artificial Intelligence Conference (ITAIC), 2011 6th IEEE Joint International
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-1-4244-8622-9
  • Type

    conf

  • DOI
    10.1109/ITAIC.2011.6030203
  • Filename
    6030203