• DocumentCode
    3293192
  • Title

    Stability analysis of pRAM reinforcement learning

  • Author

    Adeodato, Paulo J L ; Taylor, John G.

  • Author_Institution
    Dept. de Inf., Univ. Federal de Pernambuco, Recife, Brazil
  • fYear
    1997
  • fDate
    3-5 Dec 1997
  • Firstpage
    41
  • Lastpage
    50
  • Abstract
    Generalisation has been a major issue in RAM-based neural networks. In pRAM networks generalisation is produced by noisy reinforcement learning-a completely hardware implementable (built-in) algorithm. This paper presents the first part of a modular technique to analyse the formation of the basins of attraction in such systems. It proves that reinforcement learning in a single pRAM site is a globally stable system in the continuous limit of incremental learning. It also shows how the stable state depends on the penalty/reward ratio and on the learning rate. The evolution of learning in the time domain shows the effects of the initial state and of the halting moment in the final state. The paper ends with considerations on how noise contributes to the formation of basins of attraction in pRAM neurons
  • Keywords
    circuit stability; generalisation (artificial intelligence); learning (artificial intelligence); neural chips; noise; pattern recognition; time-domain analysis; RAM-based neural networks; basins of attraction; generalisation; neural net chip; noise; pRAM networks; pattern recognition; penalty/reward ratio; reinforcement learning; stability; time domain; Educational institutions; Learning; Neural network hardware; Neural networks; Neurons; Noise generators; Phase change random access memory; Read-write memory; Stability analysis; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1997. Proceedings., IVth Brazilian Symposium on
  • Conference_Location
    Goiania
  • Print_ISBN
    0-8186-8070-9
  • Type

    conf

  • DOI
    10.1109/SBRN.1997.645847
  • Filename
    645847