Title :
Finding the Secret of Image Saliency in the Frequency Domain
Author :
Jia Li ; Ling-Yu Duan ; Xiaowu Chen ; Tiejun Huang ; Yonghong Tian
Author_Institution :
State Key Lab. of Virtual Reality Technol. & Syst., Beihang Univ., Beijing, China
Abstract :
There are two sides to every story of visual saliency modeling in the frequency domain. On the one hand, image saliency can be effectively estimated by applying simple operations to the frequency spectrum. On the other hand, it is still unclear which part of the frequency spectrum contributes the most to popping-out targets and suppressing distractors. Toward this end, this paper tentatively explores the secret of image saliency in the frequency domain. From the results obtained in several qualitative and quantitative experiments, we find that the secret of visual saliency may mainly hide in the phases of intermediate frequencies. To explain this finding, we reinterpret the concept of discrete Fourier transform from the perspective of template-based contrast computation and thus develop several principles for designing the saliency detector in the frequency domain. Following these principles, we propose a novel approach to design the saliency detector under the assistance of prior knowledge obtained through both unsupervised and supervised learning processes. Experimental results on a public image benchmark show that the learned saliency detector outperforms 18 state-of-the-art approaches in predicting human fixations.
Keywords :
discrete Fourier transforms; frequency-domain analysis; image processing; object detection; security of data; unsupervised learning; discrete Fourier transform; distractor suppression; frequency domain; frequency spectrum; image saliency; saliency detector; supervised learning process; template-based contrast computation; unsupervised learning process; visual saliency modeling; Artificial intelligence; Computational modeling; Discrete Fourier transforms; Discrete cosine transforms; Fourier transforms; Frequency-domain analysis; Prediction models; Fourier transform; Image saliency; experimental study; fixation prediction; learning-based; spectral analysis;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2015.2424870