• DocumentCode
    3428666
  • Title

    Semantically-Based Human Scanpath Estimation with HMMs

  • Author

    Huiying Liu ; Dong Xu ; Qingming Huang ; Wen Li ; Min Xu ; Lin, Shunjiang

  • Author_Institution
    Inst. for Infocomm Res., Singapore, Singapore
  • fYear
    2013
  • fDate
    1-8 Dec. 2013
  • Firstpage
    3232
  • Lastpage
    3239
  • Abstract
    We present a method for estimating human scan paths, which are sequences of gaze shifts that follow visual attention over an image. In this work, scan paths are modeled based on three principal factors that influence human attention, namely low-level feature saliency, spatial position, and semantic content. Low-level feature saliency is formulated as transition probabilities between different image regions based on feature differences. The effect of spatial position on gaze shifts is modeled as a Levy flight with the shifts following a 2D Cauchy distribution. To account for semantic content, we propose to use a Hidden Markov Model (HMM) with a Bag-of-Visual-Words descriptor of image regions. An HMM is well-suited for this purpose in that 1) the hidden states, obtained by unsupervised learning, can represent latent semantic concepts, 2) the prior distribution of the hidden states describes visual attraction to the semantic concepts, and 3) the transition probabilities represent human gaze shift patterns. The proposed method is applied to task-driven viewing processes. Experiments and analysis performed on human eye gaze data verify the effectiveness of this method.
  • Keywords
    feature extraction; gaze tracking; hidden Markov models; image representation; image sequences; unsupervised learning; 2D Cauchy distribution; HMM; Levy flight; bag-of-visual-word descriptor; feature difference; gaze shift sequences; hidden Markov model; hidden states; human attention; human eye gaze data; human gaze shift pattern representation; image region; latent semantic concept; low-level feature saliency; semantic content; semantically-based human scanpath estimation; spatial position; spatial position effect; task-driven viewing process; transition probability; unsupervised learning; visual attention; visual attraction; Estimation; Hidden Markov models; Image color analysis; Probability; Semantics; Training; Visualization; Attention; Gaze shift; Hidden Markov Model; Levy flight; Saliency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2013 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2013.401
  • Filename
    6751513