• DocumentCode
    46823
  • Title

    Efficient and Robust Pupil Size and Blink Estimation From Near-Field Video Sequences for Human–Machine Interaction

  • Author

    Siyuan Chen ; Epps, Julien

  • Author_Institution
    Sch. of Electr. Eng. & Telecommun., Univ. of New South Wales, Sydney, NSW, Australia
  • Volume
    44
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    2356
  • Lastpage
    2367
  • Abstract
    Monitoring pupil and blink dynamics has applications in cognitive load measurement during human-machine interaction. However, accurate, efficient, and robust pupil size and blink estimation pose significant challenges to the efficacy of real-time applications due to the variability of eye images, hence to date, require manual intervention for fine tuning of parameters. In this paper, a novel self-tuning threshold method, which is applicable to any infrared-illuminated eye images without a tuning parameter, is proposed for segmenting the pupil from the background images recorded by a low cost webcam placed near the eye. A convex hull and a dual-ellipse fitting method are also proposed to select pupil boundary points and to detect the eyelid occlusion state. Experimental results on a realistic video dataset show that the measurement accuracy using the proposed methods is higher than that of widely used manually tuned parameter methods or fixed parameter methods. Importantly, it demonstrates convenience and robustness for an accurate and fast estimate of eye activity in the presence of variations due to different users, task types, load, and environments. Cognitive load measurement in human-machine interaction can benefit from this computationally efficient implementation without requiring a threshold calibration beforehand. Thus, one can envisage a mini IR camera embedded in a lightweight glasses frame, like Google Glass, for convenient applications of real-time adaptive aiding and task management in the future.
  • Keywords
    image segmentation; image sequences; man-machine systems; real-time systems; user interfaces; video databases; video signal processing; Google Glass; background images; blink dynamics; blink estimation pose; cognitive load measurement; computationally efficient implementation; convex hull; dual-ellipse fitting method; eye activity; eyelid occlusion state; fixed parameter methods; human-machine interaction; infrared-illuminated eye images; lightweight glasses frame; low cost webcam; manually tuned parameter methods; measurement accuracy; mini IR camera; near-field video sequences; pupil boundary points; pupil segmentation; real-time adaptive aiding; realistic video dataset; robust pupil size; self-tuning threshold method; task management; task types; Cameras; Eyelids; Histograms; Image segmentation; Real-time systems; Robustness; Shape; Biometrics; cognitive informatics; human factors; image segmentation; pervasive computing;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2306916
  • Filename
    6777305