مرکز منطقه ای اطلاع رساني علوم و فناوري - Intrinsic Dimensionality Predicts the Saliency of Natural Dynamic Scenes

DocumentCode :

1348389

Title :

Intrinsic Dimensionality Predicts the Saliency of Natural Dynamic Scenes

Author :

Vig, Eleonora ; Dorr, Michael ; Martinetz, Thomas ; Barth, Erhardt

Author_Institution :

Inst. for Neuroand Bioinf., Univ. of Lubeck, Lubeck, Germany

Volume :

Issue :

fYear :

2012

fDate :

6/1/2012 12:00:00 AM

Firstpage :

1080

Lastpage :

1091

Abstract :

Since visual attention-based computer vision applications have gained popularity, ever more complex, biologically inspired models seem to be needed to predict salient locations (or interest points) in naturalistic scenes. In this paper, we explore how far one can go in predicting eye movements by using only basic signal processing, such as image representations derived from efficient coding principles, and machine learning. To this end, we gradually increase the complexity of a model from simple single-scale saliency maps computed on grayscale videos to spatiotemporal multiscale and multispectral representations. Using a large collection of eye movements on high-resolution videos, supervised learning techniques fine-tune the free parameters whose addition is inevitable with increasing complexity. The proposed model, although very simple, demonstrates significant improvement in predicting salient locations in naturalistic videos over four selected baseline models and two distinct data labeling scenarios.

Keywords :

computer vision; image representation; image resolution; iris recognition; learning (artificial intelligence); natural scenes; video signal processing; biologically inspired models; data labeling; eye movement prediction; grayscale videos; high-resolution videos; image coding principles; image representations; intrinsic dimensionality; machine learning; natural dynamic scenes saliency; naturalistic scenes; naturalistic videos; signal processing; single-scale saliency maps; spatiotemporal multiscale representations; spatiotemporal multispectral representations; supervised learning techniques; visual attention-based computer vision applications; Biological system modeling; Computational modeling; Feature extraction; Image color analysis; Predictive models; Videos; Visualization; Computational models of vision; computer vision; eye movement prediction; interest point detection.; intrinsic dimension; spatiotemporal saliency; video analysis; visual attention; Algorithms; Eye Movements; Humans; Pattern Recognition, Visual; Principal Component Analysis; Video Recording; Vision, Ocular; Visual Perception;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/TPAMI.2011.198

Filename :

6042873

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1348389