• DocumentCode
    3421743
  • Title

    From Where and How to What We See

  • Author

    Karthikeyan, S. ; Jagadeesh, Vignesh ; Shenoy, Renuka ; Ecksteinz, Miguel ; Manjunath, B.S.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of California Santa Barbara, Santa Barbara, CA, USA
  • fYear
    2013
  • fDate
    1-8 Dec. 2013
  • Firstpage
    625
  • Lastpage
    632
  • Abstract
    Eye movement studies have confirmed that overt attention is highly biased towards faces and text regions in images. In this paper we explore a novel problem of predicting face and text regions in images using eye tracking data from multiple subjects. The problem is challenging as we aim to predict the semantics (face/text/background) only from eye tracking data without utilizing any image information. The proposed algorithm spatially clusters eye tracking data obtained in an image into different coherent groups and subsequently models the likelihood of the clusters containing faces and text using a fully connected Markov Random Field (MRF). Given the eye tracking data from a test image, it predicts potential face/head (humans, dogs and cats) and text locations reliably. Furthermore, the approach can be used to select regions of interest for further analysis by object detectors for faces and text. The hybrid eye position/object detector approach achieves better detection performance and reduced computation time compared to using only the object detection algorithm. We also present a new eye tracking dataset on 300 images selected from ICDAR, Street-view, Flickr and Oxford-IIIT Pet Dataset from 15 subjects.
  • Keywords
    Markov processes; face recognition; gaze tracking; text detection; Flickr dataset; ICDAR dataset; Markov random field; Oxford-IIIT Pet Dataset; Street-view dataset; computation time; detection performance; eye movement; eye tracking dataset; face regions; hybrid eye position-object detector approach; image information; object detectors; semantics prediction; spatially clusters eye tracking data; test image; text locations; text regions; Cats; Clustering algorithms; Dogs; Image color analysis; Reliability; Semantics; Tracking; Dog and Cat Detection; Eye Tracking; Text Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2013 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2013.83
  • Filename
    6751187