• DocumentCode
    1466702
  • Title

    3-D Head Tracking via Invariant Keypoint Learning

  • Author

    Wang, Haibo ; Davoine, Franck ; Lepetit, Vincent ; Chaillou, Christophe ; Pan, Chunhong

  • Author_Institution
    Shandong Univ., Jinan, China
  • Volume
    22
  • Issue
    8
  • fYear
    2012
  • Firstpage
    1113
  • Lastpage
    1126
  • Abstract
    Keypoint matching is a standard tool to solve the correspondence problem in vision applications. However, in 3-D face tracking, this approach is often deficient because the human face complexities, together with its rich viewpoint, nonrigid expression, and lighting variations in typical applications, can cause many variations impossible to handle by existing keypoint detectors and descriptors. In this paper, we propose a new approach to tailor keypoint matching to track the 3-D pose of the user head in a video stream. The core idea is to learn keypoints that are explicitly invariant to these challenging transformations. First, we select keypoints that are stable under randomly drawn small viewpoints, nonrigid deformations, and illumination changes. Then, we treat keypoint descriptor learning at different large angles as an incremental scheme to learn discriminative descriptors. At matching time, to reduce the ratio of outlier correspondences, we use second-order color information to prune keypoints unlikely to lie on the face. Moreover, we integrate optical flow correspondences in an adaptive way to remove motion jitter efficiently. Extensive experiments show that the proposed approach can lead to fast, robust, and accurate 3-D head tracking results even under very challenging scenarios.
  • Keywords
    computer vision; image colour analysis; image matching; image motion analysis; image sequences; jitter; learning (artificial intelligence); lighting; object tracking; pose estimation; video signal processing; 3D face tracking; 3D head tracking; 3D pose tracking; discriminative descriptor learning; illumination changes; incremental learning; invariant keypoint learning; keypoint descriptor learning; keypoint detector; keypoint matching; lighting variations; matching time; motion jitter removal; nonrigid deformations; nonrigid expression; optical flow correspondences; outlier correspondences ratio reduction; second-order color information; video stream; vision applications; Face; Image color analysis; Lighting; Nonlinear distortion; Optical imaging; Three dimensional displays; 3-D head tracking; keypoint-based tracking; pose estimation;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2012.2190474
  • Filename
    6166872