Stability analysis of pRAM reinforcement learning

Author

Adeodato, Paulo J L ; Taylor, John G.

Author_Institution

Dept. de Inf., Univ. Federal de Pernambuco, Recife, Brazil

fYear

1997

fDate

3-5 Dec 1997

Firstpage

41

Lastpage

50

Abstract

Generalisation has been a major issue in RAM-based neural networks. In pRAM networks generalisation is produced by noisy reinforcement learning-a completely hardware implementable (built-in) algorithm. This paper presents the first part of a modular technique to analyse the formation of the basins of attraction in such systems. It proves that reinforcement learning in a single pRAM site is a globally stable system in the continuous limit of incremental learning. It also shows how the stable state depends on the penalty/reward ratio and on the learning rate. The evolution of learning in the time domain shows the effects of the initial state and of the halting moment in the final state. The paper ends with considerations on how noise contributes to the formation of basins of attraction in pRAM neurons

Keywords

circuit stability; generalisation (artificial intelligence); learning (artificial intelligence); neural chips; noise; pattern recognition; time-domain analysis; RAM-based neural networks; basins of attraction; generalisation; neural net chip; noise; pRAM networks; pattern recognition; penalty/reward ratio; reinforcement learning; stability; time domain; Educational institutions; Learning; Neural network hardware; Neural networks; Neurons; Noise generators; Phase change random access memory; Read-write memory; Stability analysis; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1997. Proceedings., IVth Brazilian Symposium on

Conference_Location

Goiania

Print_ISBN

0-8186-8070-9

Type

conf

DOI

10.1109/SBRN.1997.645847

Filename

645847