• DocumentCode
    1440168
  • Title

    Reading Users´ Minds From Their Eyes: A Method for Implicit Image Annotation

  • Author

    Hajimirza, S. Navid ; Proulx, Michael J. ; Izquierdo, Ebroul

  • Author_Institution
    Multimedia & Vision Group, Queen Mary Univ. of London, London, UK
  • Volume
    14
  • Issue
    3
  • fYear
    2012
  • fDate
    6/1/2012 12:00:00 AM
  • Firstpage
    805
  • Lastpage
    815
  • Abstract
    This paper explores the possible solutions for image annotation and retrieval by implicitly monitoring user attention via eye-tracking. Features are extracted from the gaze trajectory of users examining sets of images to provide implicit information on the target template that guides visual attention. Our Gaze Inference System (GIS) is a fuzzy logic based framework that analyzes the gaze-movement features to assign a user interest level (UIL) from 0 to 1 to every image that appeared on the screen. Because some properties of the gaze features are unique for every user, our user adaptive framework builds a new processing system for every new user to achieve higher accuracy. The generated UILs can be used for image annotation purposes; however, the output of our system is not limited as it can be used also for retrieval or other scenarios. The developed framework produces promising and reliable UILs where approximately 53% of target images in the users´ minds can be identified by the machine with an error of less than 20% and the top 10% of them with no error. We show in this paper that the existing information in gaze patterns can be employed to improve the machine´s judgement of image content by assessment of human interest and attention to the objects inside virtual environments.
  • Keywords
    feature extraction; fuzzy logic; image retrieval; target tracking; UIL; eye-tracking; feature extraction; fuzzy logic based framework; gaze inference system; gaze patterns; human attention; human interest; image content; image retrieval; implicit image annotation; implicit information; machine judgement; target template; user adaptive framework; user attention monitoring; user interest level; user minds; virtual environments; visual attention; Accuracy; Feature extraction; Humans; Psychology; Training; Vectors; Visualization; Gaze tracking; human computer interaction; image annotation; image databases; image retrieval; implicit annotation; visual attention;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2012.2186792
  • Filename
    6145688