Title :
Temporal differences learning with the conjugate gradient algorithm
Author :
Falas, Tasos ; Stafylopatis, Andreas-Georgios
Author_Institution :
Nat. Tech. Univ. of Athens, Greece
Abstract :
This paper investigates the use of the conjugate gradient (CG) algorithm in comparison to the traditional backpropagation (BP) algorithm, applying to the temporal difference (TD) method for reinforcement learning. Time series prediction is the application domain examined. Simple time series as well as more complex ones, coming from real data (stock market indices), are used as benchmark problems. The performance measures used are the learning speed, the generalization ability, and the sensitivity on user-set parameters. Preliminary experimental results suggest that the performance of TD learning can be significantly improved when the CG algorithm is employed, as compared to the traditional BP algorithm. In addition, as expected, the CG algorithm has been proved to be more robust and less dependent on user-set training parameters and initial conditions, especially for rather complicated time series. The use of the CG algorithm in TD learning is therefore promising for real-life applications in time series prediction
Keywords :
conjugate gradient methods; generalisation (artificial intelligence); learning (artificial intelligence); neural nets; optimisation; time series; conjugate gradient algorithm; generalization; neural networks; optimisation; reinforcement learning; temporal difference learning; time series; Backpropagation algorithms; Character generation; Educational institutions; Learning systems; Neural networks; Robustness; Signal generators; Signal processing; Stock markets; Supervised learning;
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-7044-9
DOI :
10.1109/IJCNN.2001.939012