DocumentCode
3293192
Title
Stability analysis of pRAM reinforcement learning
Author
Adeodato, Paulo J L ; Taylor, John G.
Author_Institution
Dept. de Inf., Univ. Federal de Pernambuco, Recife, Brazil
fYear
1997
fDate
3-5 Dec 1997
Firstpage
41
Lastpage
50
Abstract
Generalisation has been a major issue in RAM-based neural networks. In pRAM networks generalisation is produced by noisy reinforcement learning-a completely hardware implementable (built-in) algorithm. This paper presents the first part of a modular technique to analyse the formation of the basins of attraction in such systems. It proves that reinforcement learning in a single pRAM site is a globally stable system in the continuous limit of incremental learning. It also shows how the stable state depends on the penalty/reward ratio and on the learning rate. The evolution of learning in the time domain shows the effects of the initial state and of the halting moment in the final state. The paper ends with considerations on how noise contributes to the formation of basins of attraction in pRAM neurons
Keywords
circuit stability; generalisation (artificial intelligence); learning (artificial intelligence); neural chips; noise; pattern recognition; time-domain analysis; RAM-based neural networks; basins of attraction; generalisation; neural net chip; noise; pRAM networks; pattern recognition; penalty/reward ratio; reinforcement learning; stability; time domain; Educational institutions; Learning; Neural network hardware; Neural networks; Neurons; Noise generators; Phase change random access memory; Read-write memory; Stability analysis; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1997. Proceedings., IVth Brazilian Symposium on
Conference_Location
Goiania
Print_ISBN
0-8186-8070-9
Type
conf
DOI
10.1109/SBRN.1997.645847
Filename
645847
Link To Document