Title :
A reinforcement learning algorithm to improve scheduling search heuristics with the SVM
Author :
Gersmann, Kai ; Hammer, Barbara
Author_Institution :
Dept. of Math. & Comput. Sci., Osnabruck Univ., Germany
Abstract :
A regret-based biased random sampling scheme (RBRS) is a simple but powerful priority-rule based method to solve the resource constrained project scheduling problem (RCPSP), a well-known NP-hard benchmark problem. We present a generic machine learning method to improve results of RBRS. The rout-algorithm of reinforcement learning is combined with the support vector machine (SVM) to learn an appropriate value function which guides the search strategy given by RBRS. The specific properties of the SVM allow to reduce the size of the training set and show improved results even after a short period of training as demonstrated for benchmark instances of the RCPSP.
Keywords :
computational complexity; function approximation; learning (artificial intelligence); optimisation; random processes; sampling methods; scheduling; search problems; support vector machines; NP hard benchmark problem; SVM; generic machine learning method; regret based biased random sampling; reinforcement learning algorithm; resource constrained project scheduling problem; rout algorithm; rule based method; search heuristic scheduling; support vector machine; training set; value function approximation; Function approximation; Iterative algorithms; Learning systems; Machine learning; NASA; Payloads; Sampling methods; Scheduling algorithm; Space shuttles; Support vector machines;
Conference_Titel :
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
Print_ISBN :
0-7803-8359-1
DOI :
10.1109/IJCNN.2004.1380883