• DocumentCode
    3147643
  • Title

    Dynamic neural network control through fuzzy Q-learning algorithms

  • Author

    Deng, Z.D. ; Kwok, D.P.

  • Author_Institution
    State Key Lab. of Intelligent Tech. & Syst., Tsinghua Univ., Beijing, China
  • Volume
    1
  • fYear
    1997
  • fDate
    28-31 Oct 1997
  • Firstpage
    381
  • Abstract
    An efficient Q-learning paradigm implemented on a fuzzy CMAC network is proposed. The fuzzy CMAC network topological architecture is described. First, the continuous states of the system are partitioned into a number of fuzzy boxes. Second, the proposed fuzzy CMAC evaluates the Q-values of agents in the fired fuzzy boxes and chooses control actions with maximum Q-values. Then a critic generates an external reinforcement signal according to the outcome or the effect of the control at every time-step, which is used later for further improving the estimation of these Q-values. To speed up the convergence of reinforcement learning, the traditional PID controller with several groups of different parameters is adopted so as to collect a number of taught-lessons. These taught-lessons together with the experienced lessons generated automatically, are sequentially replayed and learned, respectively, under the guidance of different reinforcement mechanisms. The hybrid adaptive and learning control system is applied to the control of a pH-neutralization process. Simulation investigations show that the fuzzy connectionist Q-learning control system has more adaptive, higher intelligence, and stronger generalization ability compared to neural network or fuzzy neural network control techniques using supervised learning
  • Keywords
    adaptive systems; cerebellar model arithmetic computers; fuzzy control; fuzzy neural nets; generalisation (artificial intelligence); intelligent control; learning (artificial intelligence); learning systems; neural net architecture; neurocontrollers; pH control; simulation; software agents; PID controller; adaptive ability; agents; continuous states; control actions; convergence; dynamic neural network control; efficient Q-learning paradigm; external reinforcement signal; fuzzy CMAC network; fuzzy Q-learning algorithms; fuzzy boxes; fuzzy connectionist Q-learning control system; generalization ability; hybrid adaptive/learning control system; intelligence; maximum Q-values; pH neutralization process control; reinforcement learning; simulation; taught lesson; topological architecture; Adaptive control; Automatic control; Control systems; Fuzzy control; Fuzzy neural networks; Fuzzy systems; Neural networks; Partitioning algorithms; Programmable control; Signal generators;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Processing Systems, 1997. ICIPS '97. 1997 IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-4253-4
  • Type

    conf

  • DOI
    10.1109/ICIPS.1997.672805
  • Filename
    672805