Learning to Relate Images

Author

Memisevic, Roland

Author_Institution

Dept. of Comput. Sci. & Oper. Res., Univ. of Montreal, Montreal, QC, Canada

Volume

35

Issue

8

fYear

2013

fDate

Aug. 2013

Firstpage

1829

Lastpage

1846

Abstract

A fundamental operation in many vision tasks, including motion understanding, stereopsis, visual odometry, or invariant recognition, is establishing correspondences between images or between images and data from other modalities. Recently, there has been increasing interest in learning to infer correspondences from data using relational, spatiotemporal, and bilinear variants of deep learning methods. These methods use multiplicative interactions between pixels or between features to represent correlation patterns across multiple images. In this paper, we review the recent work on relational feature learning, and we provide an analysis of the role that multiplicative interactions play in learning to encode relations. We also discuss how square-pooling and complex cell models can be viewed as a way to represent multiplicative interactions and thereby as a way to encode relations.

Keywords

computer vision; correlation methods; image coding; inference mechanisms; learning (artificial intelligence); spatiotemporal phenomena; bilinear deep-learning method; complex cell model; correlation pattern representation; image features; image pixels; inference framework; multiplicative interaction representation; relation encoding; relational deep-learning method; relational feature learning; spatiotemporal deep-learning method; square-pooling model; vision tasks; Computational modeling; Image recognition; Learning systems; Logic gates; Mathematical model; Standards; Training; Learning image relations; complex cells; energy models; mapping units; spatiotemporal features; Algorithms; Artificial Intelligence; Humans; Pattern Recognition, Automated; Pattern Recognition, Visual;

fLanguage

English

Journal_Title

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher

ieee

ISSN

0162-8828

Type

jour

DOI

10.1109/TPAMI.2013.53

Filename

6475945