Title :
Efficient computing budget allocation for simulation-based policy improvement
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
Abstract :
The dynamics of many systems in nowadays follow not only physical laws but also man-made rules. These systems are known as discrete event dynamic systems (DEDS´s). Policy improvement of such systems is usually based on simulation, which is time consuming and provides only noisy performance evaluation. It is of great practical interest to understand how to allocate the computing budget among action candidates so that a better policy is found with high probability. Despite the abundant studies on simulation-based policy optimization, few consider this allocation problem. This paper considers this important problem. Based on the method of optimal computing budget allocation (OCBA) in simulation-based optimization, an efficient allocation procedure is developed, which is shown to asymptotically maximize a lower bound of the probability of correctly selecting the best action. This allocation procedure is compared with equal allocation, which is well adopted in practice, on numerical examples. The numerical results show that even when there are only finite computing budget to allocate, this OCBA-based allocation procedure outperforms equal allocation and has good performance. We hope this work brings insight to computing budget allocation for simulation-based policy improvement in more general situations.
Keywords :
budgeting; discrete event systems; optimisation; probability; OCBA; budget allocation; optimal computing budget allocation; probability; simulation-based optimization; simulation-based policy improvement; Computational modeling; Dynamic programming; Function approximation; Markov processes; Optimization; Resource management; Discrete event dynamic system; optimal computing budget allocation; simulation-based policy improvement;
Conference_Titel :
Control Conference (ASCC), 2011 8th Asian
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-61284-487-9
Electronic_ISBN :
978-89-956056-4-6