• DocumentCode
    2294526
  • Title

    The Ant(λ) ant colony optimization algorithm based on eligibility trace

  • Author

    Wang, Xiao-Rong ; Wu, Tie-Jun

  • Author_Institution
    Nat. Lab. of Ind. Control Technol., Zhejiang Univ., Hangzhou, China
  • Volume
    5
  • fYear
    2003
  • fDate
    5-8 Oct. 2003
  • Firstpage
    4065
  • Abstract
    The pheromone-based parameterized probabilistic model for the ACO algorithm is presented as the construction graph that the combinatorial optimization problem can be mapped on. Based on the construction graph, the solution construction procedure and update rule of pheromone model in the ACO algorithm is illustrated. The finite deterministic Markov decision process corresponding to the solution construction procedure is illustrated in the terminology of reinforcement learning (RL) theory. The ACO algorithms are fitted into the framework of generalized policy iteration (GPl) in RL based on incomplete information of the Markov state. Furthermore, we show that the pheromone update in the ACS and Ant-Q algorithm is based on the MC methods or some formalistic combination of MC methods and TD methods. TD methods have usually been found to converge faster than MC methods in many applications, but works worse than the MC method in the non-Markov environment. We propose a novel ACO algorithm, Ant(λ) algorithm, which introduces the eligibility trace mechanism into the local update procedure of pheromone, the algorithm unifies the TD method and MC method mathematically, and in the algorithm, the delayed reinforcement can be back propagated in time.
  • Keywords
    Markov processes; convergence; decision making; evolutionary computation; graph theory; iterative methods; learning (artificial intelligence); optimisation; probability; Markov decision process; ant colony optimization algorithm; combinatorial optimization problem; construction graph; convergence; eligibility trace mechanism; evolutionary computation; generalized policy iteration; parameterized probabilistic model; pheromone model; reinforcement learning theory; Ant colony optimization; Cities and towns; Costs; Decision making; Delay effects; Intelligent systems; Learning; Monte Carlo methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2003. IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-7952-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2003.1245624
  • Filename
    1245624