DocumentCode :
1749051
Title :
Temporal differences learning with the conjugate gradient algorithm
Author :
Falas, Tasos ; Stafylopatis, Andreas-Georgios
Author_Institution :
Nat. Tech. Univ. of Athens, Greece
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
171
Abstract :
This paper investigates the use of the conjugate gradient (CG) algorithm in comparison to the traditional backpropagation (BP) algorithm, applying to the temporal difference (TD) method for reinforcement learning. Time series prediction is the application domain examined. Simple time series as well as more complex ones, coming from real data (stock market indices), are used as benchmark problems. The performance measures used are the learning speed, the generalization ability, and the sensitivity on user-set parameters. Preliminary experimental results suggest that the performance of TD learning can be significantly improved when the CG algorithm is employed, as compared to the traditional BP algorithm. In addition, as expected, the CG algorithm has been proved to be more robust and less dependent on user-set training parameters and initial conditions, especially for rather complicated time series. The use of the CG algorithm in TD learning is therefore promising for real-life applications in time series prediction
Keywords :
conjugate gradient methods; generalisation (artificial intelligence); learning (artificial intelligence); neural nets; optimisation; time series; conjugate gradient algorithm; generalization; neural networks; optimisation; reinforcement learning; temporal difference learning; time series; Backpropagation algorithms; Character generation; Educational institutions; Learning systems; Neural networks; Robustness; Signal generators; Signal processing; Stock markets; Supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
ISSN :
1098-7576
Print_ISBN :
0-7803-7044-9
Type :
conf
DOI :
10.1109/IJCNN.2001.939012
Filename :
939012
Link To Document :
بازگشت