Title :
Genetic algorithm methods for solving the best stationary policy of finite Markov decision processes
Author :
Chin, Hubert H. ; Jafari, Ayat A.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci, New York Inst. of Technol., Old Westbury, NY, USA
Abstract :
This paper describes a heuristic approach to solving the optimal stationary policy of the standard finite Markov decision processes (MDP). For a MDP problem, there is a so-called policy-improvement algorithm, which can be used to determine optimal policies. It starts at an arbitrary policy f0 and produces a sequence of improvements f1, f2, f3,...fk until an optimal policy is reached. In this paper, we propose to utilize the genetic algorithm method to search the best policy that can be considered as an optimal policy. The method is a three-stage cyclic process consisting of a reproduction (selection), recombination (mating), and evaluation (survival of the fittest); and lastly, to terminate the process by setting a convergent condition. The highest fitness individual presents a best policy. In conclusion, the computational advantages of using the genetic algorithm methods are discussed
Keywords :
Markov processes; decision theory; genetic algorithms; heuristic programming; best stationary policy; convergent condition; evaluation; finite Markov decision processes; genetic algorithm method; heuristic approach; mating; optimal stationary policy; policy-improvement algorithm; recombination; reproduction; selection; three-stage cyclic process; Algorithm design and analysis; Biology computing; Communication networks; Context modeling; Evolution (biology); Genetic algorithms; Optimal control; Optimization methods; Stochastic processes;
Conference_Titel :
System Theory, 1998. Proceedings of the Thirtieth Southeastern Symposium on
Conference_Location :
Morgantown, WV
Print_ISBN :
0-7803-4547-9
DOI :
10.1109/SSST.1998.660132