DocumentCode
1861726
Title
Basis iteration for reward based dimensionality reduction
Author
Sprague, Nathan
Author_Institution
Kalamazoo Coll., Kalamazoo
fYear
2007
fDate
11-13 July 2007
Firstpage
187
Lastpage
192
Abstract
We propose a linear dimensionality reduction algorithm that selectively preserves task relevant state data for control problems modeled as Markov decision processes. The algorithm works by alternating value function estimation with basis vector adaptation. The approach is demonstrated on two tasks: a toy task designed to illustrate the key concepts, and a more complex three dimensional navigation task.
Keywords
iterative methods; learning (artificial intelligence); Markov decision process; basis vector adaptation; iteration method; reward based dimensionality reduction; value function estimation; Data mining; Educational institutions; Independent component analysis; Learning; Least squares approximation; Least squares methods; Navigation; Predictive coding; Principal component analysis; Vectors; Dimensionality Reduction; Perceptual Development; Reinforcement Learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Development and Learning, 2007. ICDL 2007. IEEE 6th International Conference on
Conference_Location
London
Print_ISBN
978-1-4244-1116-0
Electronic_ISBN
978-1-4244-1116-0
Type
conf
DOI
10.1109/DEVLRN.2007.4354032
Filename
4354032
Link To Document