Title :
SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control
Author :
Deb, Alok Kanti ; Jayadeva ; Gopal, Madan ; Chandra, Suresh
Author_Institution :
Indian Stat. Inst., Kolkata
fDate :
7/1/2007 12:00:00 AM
Abstract :
In this paper, we use the approach of adaptive critic design (ACD) for control, specifically, the action-dependent heuristic dynamic programming (ADHDP) method. A least squares support vector machine (SVM) regressor has been used for generating the control actions, while an SVM-based tree-type neural network (NN) is used as the critic. After a failure occurs, the critic and action are retrained in tandem using the failure data. Failure data is binary classification data, where the number of failure states are very few as compared to the number of no-failure states. The difficulty of conventional multilayer feedforward NNs in learning this type of classification data has been overcome by using the SVM-based tree-type NN, which due to its feature to add neurons to learn misclassified data, has the capability to learn any binary classification data without a priori choice of the number of neurons or the structure of the network. The capability of the trained controller to handle unforeseen situations is demonstrated.
Keywords :
adaptive control; dynamic programming; feedforward neural nets; heuristic programming; learning (artificial intelligence); least squares approximations; neurocontrollers; pattern classification; support vector machines; SVM; action-dependent heuristic dynamic programming; adaptive critic designs; binary classification data; failure data; least squares support vector machine regressor; multilayer feedforward neural nets; trained controller; tree-type neural networks; Adaptive control; Classification tree analysis; Dynamic programming; Least squares methods; Neural networks; Neurons; Programmable control; Regression tree analysis; Support vector machine classification; Support vector machines; Adaptive control; adaptive critic designs (ACDs); intelligent control; inverted pendulum; linear programming; neural network (NN) applications; support vector machines (SVMs); Algorithms; Computer Simulation; Decision Support Techniques; Feedback; Models, Theoretical; Neural Networks (Computer); Programming, Linear;
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2007.899255