• DocumentCode
    14813
  • Title

    A Visual-Attention Model Using Earth Mover´s Distance-Based Saliency Measurement and Nonlinear Feature Combination

  • Author

    Yuewei Lin ; Yuan Yan Tang ; Bin Fang ; Zhaowei Shang ; Yonghui Huang ; Song Wang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of South Carolina, Columbia, SC, USA
  • Volume
    35
  • Issue
    2
  • fYear
    2013
  • fDate
    Feb. 2013
  • Firstpage
    314
  • Lastpage
    328
  • Abstract
    This paper introduces a new computational visual-attention model for static and dynamic saliency maps. First, we use the Earth Mover´s Distance (EMD) to measure the center-surround difference in the receptive field, instead of using the Difference-of-Gaussian filter that is widely used in many previous visual-attention models. Second, we propose to take two steps of biologically inspired nonlinear operations for combining different features: combining subsets of basic features into a set of super features using the Lm-norm and then combining the super features using the Winner-Take-All mechanism. Third, we extend the proposed model to construct dynamic saliency maps from videos by using EMD for computing the center-surround difference in the spatiotemporal receptive field. We evaluate the performance of the proposed model on both static image data and video data. Comparison results show that the proposed model outperforms several existing models under a unified evaluation setting.
  • Keywords
    computer vision; feature extraction; video signal processing; Difference-of-Gaussian filter; biologically inspired nonlinear operation; center-surround difference measurement; computational visual-attention model; dynamic saliency map; earth mover´s distance-based saliency measurement; nonlinear feature combination; spatiotemporal receptive field; static image data; static saliency map; super features; video data; winner-take-all mechanism; Biological system modeling; Computational modeling; Earth; Educational institutions; Histograms; Humans; Visualization; Visual attention; dynamic saliency maps; earth mover´s distance (EMD); saliency maps; spatiotemporal receptive field (STRF); Algorithms; Artificial Intelligence; Attention; Biomimetics; Computer Simulation; Fixation, Ocular; Humans; Image Interpretation, Computer-Assisted; Models, Biological; Nonlinear Dynamics; Pattern Recognition, Automated; Pattern Recognition, Visual; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2012.119
  • Filename
    6205759