Title :
Dynamic neural network control through fuzzy Q-learning algorithms
Author :
Deng, Z.D. ; Kwok, D.P.
Author_Institution :
State Key Lab. of Intelligent Tech. & Syst., Tsinghua Univ., Beijing, China
Abstract :
An efficient Q-learning paradigm implemented on a fuzzy CMAC network is proposed. The fuzzy CMAC network topological architecture is described. First, the continuous states of the system are partitioned into a number of fuzzy boxes. Second, the proposed fuzzy CMAC evaluates the Q-values of agents in the fired fuzzy boxes and chooses control actions with maximum Q-values. Then a critic generates an external reinforcement signal according to the outcome or the effect of the control at every time-step, which is used later for further improving the estimation of these Q-values. To speed up the convergence of reinforcement learning, the traditional PID controller with several groups of different parameters is adopted so as to collect a number of taught-lessons. These taught-lessons together with the experienced lessons generated automatically, are sequentially replayed and learned, respectively, under the guidance of different reinforcement mechanisms. The hybrid adaptive and learning control system is applied to the control of a pH-neutralization process. Simulation investigations show that the fuzzy connectionist Q-learning control system has more adaptive, higher intelligence, and stronger generalization ability compared to neural network or fuzzy neural network control techniques using supervised learning
Keywords :
adaptive systems; cerebellar model arithmetic computers; fuzzy control; fuzzy neural nets; generalisation (artificial intelligence); intelligent control; learning (artificial intelligence); learning systems; neural net architecture; neurocontrollers; pH control; simulation; software agents; PID controller; adaptive ability; agents; continuous states; control actions; convergence; dynamic neural network control; efficient Q-learning paradigm; external reinforcement signal; fuzzy CMAC network; fuzzy Q-learning algorithms; fuzzy boxes; fuzzy connectionist Q-learning control system; generalization ability; hybrid adaptive/learning control system; intelligence; maximum Q-values; pH neutralization process control; reinforcement learning; simulation; taught lesson; topological architecture; Adaptive control; Automatic control; Control systems; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Neural networks; Partitioning algorithms; Programmable control; Signal generators;
Conference_Titel :
Intelligent Processing Systems, 1997. ICIPS '97. 1997 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-4253-4
DOI :
10.1109/ICIPS.1997.672805