مرکز منطقه ای اطلاع رساني علوم و فناوري - Hierachical learning of visual invariances via spatio-temporal constraints

DocumentCode :

1621485

Title :

Hierachical learning of visual invariances via spatio-temporal constraints

Author :

Stone, J.V.

Author_Institution :

Sussex Univ., Brighton, UK

fYear :

1995

Firstpage :

110

Lastpage :

115

Abstract :

An unsupervised learning algorithm is presented for the acquisition of low-level vision tasks, such as the extraction of surface depth. A key assumption is that perceptually salient parameters in the image of a moving object vary smoothly over time. This assumption is consistent with a learning rule which maximises the long-term variance of a unit´s outputs, whilst simultaneously minimising its short-term variance. This learning rule involves a linear combination of anti-Hebbian and Hebbian weight changes, over short and long time scales respectively. It can be shown that the learning rule maximises the mutual information between a unit´s output and the short-term exponentially weighted mean of that unit´s output. The model is demonstrated on a hyper-acuity task: estimating sub-pixel stereo disparity from a temporal sequence of stereograms. The model can discriminate disparities as small as one-tenth of the width of an input receptor, which compares with human performance on stereo hyper-acuity tasks. Furthermore, the algorithm generalises, without additional learning, to previously unseen image sequences. The results of applying the learning method independently to successive layers of a hierarchical network are presented

Keywords :

Hebbian learning; computer vision; generalisation (artificial intelligence); hierarchical systems; image sequences; invariance; neural nets; spatial reasoning; stereo image processing; temporal reasoning; unsupervised learning; Hebbian weight changes; antiHebbian weight changes; generalization; hierachical learning; hierarchical network; hyper-acuity task; input receptor width; low-level vision task acquisition; moving object image; mutual information; short-term exponentially weighted mean; smoothly varying perceptually salient parameters; spatiotemporal constraints; stereogram temporal sequence; sub-pixel stereo disparity estimation; surface depth extraction; unit output long-term variance maximization; unit output short-term variance minimization; unseen image sequences; unsupervised learning algorithm; visual invariances;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Artificial Neural Networks, 1995., Fourth International Conference on

Conference_Location :

Cambridge

Print_ISBN :

0-85296-641-5

Type :

conf

DOI :

10.1049/cp:19950538

Filename :

497800

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1621485