DocumentCode :
2775999
Title :
A reinforcement learning solution for the unit commitment problem
Author :
Coronado, Carla A. ; Figueroa, Marcelo R. ; Roa-Sepulveda, Claudio A.
fYear :
2012
fDate :
4-7 Sept. 2012
Firstpage :
1
Lastpage :
6
Abstract :
This work proposes a solution method for the unit commitment (UC) problem using the reinforcement learning (RL) technique. The UC problem is an optimization problem whose objective is to minimize the electrical power system operational cost. The solution of an UC problem yields an operational schedule for a set of generation units while satisfying unit operational constraints and system demand. This paper considers a four-state definition (maximum, minimum, banking and off) for the thermal generation units. In this scenario, it is verified that the modelling of the UC problem has the markovian property which allows the use of the RL method and thus giving the possibility to solve the UC problem by the means of rewards. These rewards are derived from unit operational costs, the system demand and the electrical system constraints. A two-step algorithm is proposed for searching a solution (considering the subsequent and sub subsequent states) which allows the learning agent to be able to efficiently evaluate every alternative and choose the best available one. Finally, the method performance is measured considering a 10-unit system proving the effectiveness of the proposed method.
Keywords :
thermal power stations; electrical power system operational cost; reinforcement learning solution; thermal generation units; two-step algorithm; unit commitment problem; Banking; Biological system modeling; Genetic algorithms; Learning; Linear programming; Mathematical model; Optimization; Markovian processes; Reinforcement learning; Unit commitment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Universities Power Engineering Conference (UPEC), 2012 47th International
Conference_Location :
London
Print_ISBN :
978-1-4673-2854-8
Electronic_ISBN :
978-1-4673-2855-5
Type :
conf
DOI :
10.1109/UPEC.2012.6398663
Filename :
6398663
Link To Document :
بازگشت