• DocumentCode
    2603834
  • Title

    Soft biometric trait classification from real-world face videos conditioned on head pose estimation

  • Author

    Demirkus, Meltem ; Precup, Doina ; Clark, James J. ; Arbel, Tal

  • Author_Institution
    Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    130
  • Lastpage
    137
  • Abstract
    Recently, soft biometric trait classification has been receiving more attention in the computer vision community due to its wide range of possible application areas. Most approaches in the literature have focused on trait classification in controlled environments, due to the challenges presented by real-world environments, i.e. arbitrary facial expressions, arbitrary partial occlusions, arbitrary and nonuniform illumination conditions and arbitrary background clutter. In recent years, trait classification has started to be applied to real-world environments, with some success. However, the focus has been on estimation from single images or video frames, without leveraging the temporal information available in the entire video sequence. In addition, a fixed set of features are usually used for trait classification without any consideration of possible changes in the facial features due to head pose changes. In this paper, we propose a temporal, probabilistic framework first to robustly estimate continuous head pose angles from real-world videos, and then use this pose estimate to decide on the appropriate set of frames and features to use in a temporal fusion scheme for soft biometric trait classification. Experiments performed on large, real-world video sequences show that our head pose estimator outperforms the current state-of-the-art head pose approaches (by up to 51%), whereas our head pose conditioned biometric trait classifier (for the case of gender classification) outperforms the current state-of-the-art approaches (by up to 31%).
  • Keywords
    computer vision; face recognition; image classification; image sequences; pose estimation; probability; video signal processing; arbitrary illumination; arbitrary partial occlusions; computer vision community; controlled environments; facial expressions; head pose angles; head pose estimation; image frames; nonuniform illumination; probabilistic framework; real-world environments; real-world face videos; real-world videos; soft biometric trait classification; temporal fusion; temporal information; trait classification; video frames; video sequence; video sequences; Estimation; Face; Feature extraction; Training; Video sequences; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    2160-7508
  • Print_ISBN
    978-1-4673-1611-8
  • Electronic_ISBN
    2160-7508
  • Type

    conf

  • DOI
    10.1109/CVPRW.2012.6239227
  • Filename
    6239227