• DocumentCode
    2494882
  • Title

    Robot visual homing using conjugate gradient Temporal Difference learning, radial basis features and a whole image measure

  • Author

    Altahhan, Abdulrahman

  • Author_Institution
    Fac. of Inf. & Commun. Eng., Yarmouk Private Univ., Dar´´a, Syria
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    This work describes a robot visual homing model that employs, for the first time, the conjugate gradient Temporal Difference (TD-conj) method. TD-conj was proved to be equivalent to a gradient TD method with a variable λ, denoted as (TD(λt(conj))), when both are used with function approximation techniques. This fact is employed in the model to improve its performance. Based on visual input that is passed through radial basis layer, the model takes advantage of the model-free interactive-learning capability of reinforcement learning (RL) by using a whole image measure to recognize the goal, without the aid of special landmarks. Therefore, unlike other models, this model refrains from artificially manipulating the environment or assuming a priori knowledge about it, two typical constraints that widely restrict the applicability of existing models in realistic scenarios. An on-policy on-line control method was used to train a set of neural networks. With the aid of variable eligibility traces, these networks approximates the agent´s action-value function allowing it to take optimal actions to reach its home. The effectiveness of the model was experimentally verified where an agent equipped with it achieved efficacy in finding a goal location with no a priori knowledge of the environment.
  • Keywords
    conjugate gradient methods; function approximation; image recognition; learning (artificial intelligence); radial basis function networks; robot vision; conjugate gradient temporal difference learning; function approximation technique; gradient TD method; image measure; model free interactive learning; neural network; on policy online control method; radial basis feature; reinforcement learning; robot visual homing; Biological system modeling; Current measurement; Histograms; Navigation; Pixel; Robots; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596784
  • Filename
    5596784