• DocumentCode
    2498534
  • Title

    On learning with imperfect representations

  • Author

    Kalyanakrishnan, Shivaram ; Stone, Peter

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2011
  • fDate
    11-15 April 2011
  • Firstpage
    17
  • Lastpage
    24
  • Abstract
    In this paper we present a perspective on the relationship between learning and representation in sequential decision making tasks. We undertake a brief survey of existing real-world applications, which demonstrates that the classical “tabular” representation seldom applies in practice. Specifically, several practical tasks suffer from state aliasing, and most demand some form of generalization and function approximation. Coping with these representational aspects thus becomes an important direction for furthering the advent of reinforcement learning in practice. The central thesis we present in this position paper is that in practice, learning methods specifically developed to work with imperfect representations are likely to perform better than those developed for perfect representations and then applied in imperfect-representation settings. We specify an evaluation criterion for learning methods in practice, and propose a framework for their synthesis. In particular, we highlight the degrees of “representational bias” prevalent in different learning methods. We reference a variety of relevant literature as a background for this introspective essay.
  • Keywords
    decision making; function approximation; learning (artificial intelligence); function approximation; imperfect representation setting; reinforcement learning; representational bias degree; sequential decision making tasks; state aliasing; Approximation algorithms; Computational modeling; Decision making; Function approximation; Learning systems; Least squares approximation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2011 IEEE Symposium on
  • Conference_Location
    Paris
  • Print_ISBN
    978-1-4244-9887-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2011.5967379
  • Filename
    5967379