• DocumentCode
    2489930
  • Title

    Multi-resolution state-space discretization for Q-Learning with pseudo-randomized discretization

  • Author

    Lampton, Amanda ; Valasek, John ; Kumar, Mrinal

  • Author_Institution
    Dept. of Aerosp. Eng., Texas A&M Univ., College Station, TX, USA
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    A multi-resolution state-space discretization method with pseudo-random griding is developed for the episodic unsupervised learning method of Q-Learning. It is used as the learning agent for closed-loop control of morphing or highly reconfigurable systems. This paper develops a method whereby a state-space is adaptively discretized by progressively finer pseudo-random grids around the Regions Of Interest within the state or learning space in an effort to break the Curse of Dimensionality. Utility of the method is demonstrated with application to the problem of a morphing airfoil, which is simulated by a computationally intensive computational fluid dynamics model. By setting the multi-resolution method to define the Region Of Interest by the goal the agent seeks, it is shown that this method with the pseudo-random grid can learn a specific goal within ±0.001, while reducing the total number of state-action pairs needed to achieve this level of specificity to less than 3000.
  • Keywords
    aerodynamics; multi-agent systems; unsupervised learning; Q-Learning; closed-loop control; computational fluid dynamics model; curse of dimensionality; episodic unsupervised learning method; learning agent; multiresolution state-space discretization method; pseudo-random griding; pseudo-randomized discretization; regions of interest; Artificial neural networks; Atmospheric modeling; Automotive components;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596516
  • Filename
    5596516