• DocumentCode
    615159
  • Title

    Online learning and fusion of orientation appearance models for robust rigid object tracking

  • Author

    Marras, Ioannis ; Medina, Joan Alabort ; Tzimiropoulos, Georgios ; Zafeiriou, Stefanos ; Pantic, Maja

  • Author_Institution
    Dept. of Comput., Imperial Coll. London, London, UK
  • fYear
    2013
  • fDate
    22-26 April 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    We present a robust framework for learning and fusing different modalities for rigid object tracking. Our method fuses data obtained from a standard visual camera and dense depth maps obtained by low-cost consumer depths cameras such as the Kinect. To combine these two completely different modalities, we propose to use features that do not depend on the data representation: angles. More specifically, our method combines image gradient orientations as extracted from intensity images with the directions of surface normals computed from dense depth fields provided by the Kinect. To incorporate these features in a learning framework, we use a robust kernel based on the Euler representation of angles. This kernel enables us to cope with gross measurement errors, missing data as well as typical problems in visual tracking such as illumination changes and occlusions. Additionally, the employed kernel can be efficiently implemented online. Finally, we propose to capture the correlations between the obtained orientation appearance models using a fusion approach motivated by the original AAM. Thus the proposed learning and fusing framework is robust, exact, computationally efficient and does not require off-line training. By combining the proposed models with a particle filter, the proposed tracking framework achieved robust performance in very difficult tracking scenarios including extreme pose variations.
  • Keywords
    gradient methods; image fusion; learning (artificial intelligence); particle filtering (numerical methods); pose estimation; target tracking; AAM; Euler representation; Kinect; dense depth maps; extreme pose variations; fusion approach; gross measurement errors; illumination changes; illumination occlusions; image gradient orientations; intensity images; low-cost consumer depths cameras; online learning; orientation appearance models; particle filter; robust kernel; robust rigid object tracking; standard visual camera; surface normals; Computational modeling; Hidden Markov models; Principal component analysis; Robustness; Shape; Solid modeling; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Face and Gesture Recognition (FG), 2013 10th IEEE International Conference and Workshops on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-5545-2
  • Electronic_ISBN
    978-1-4673-5544-5
  • Type

    conf

  • DOI
    10.1109/FG.2013.6553798
  • Filename
    6553798