• DocumentCode
    1485032
  • Title

    Statistical Computations on Grassmann and Stiefel Manifolds for Image and Video-Based Recognition

  • Author

    Turaga, Pavan ; Veeraraghavan, Ashok ; Srivastava, Anurag ; Chellappa, Rama

  • Author_Institution
    Center for Autom. Res., Univ. of Maryland, Coll. Park, College Park, MD, USA
  • Volume
    33
  • Issue
    11
  • fYear
    2011
  • Firstpage
    2273
  • Lastpage
    2286
  • Abstract
    In this paper, we examine image and video-based recognition applications where the underlying models have a special structure-the linear subspace structure. We discuss how commonly used parametric models for videos and image sets can be described using the unified framework of Grassmann and Stiefel manifolds. We first show that the parameters of linear dynamic models are finite-dimensional linear subspaces of appropriate dimensions. Unordered image sets as samples from a finite-dimensional linear subspace naturally fall under this framework. We show that an inference over subspaces can be naturally cast as an inference problem on the Grassmann manifold. To perform recognition using subspace-based models, we need tools from the Riemannian geometry of the Grassmann manifold. This involves a study of the geometric properties of the space, appropriate definitions of Riemannian metrics, and definition of geodesics. Further, we derive statistical modeling of inter and intraclass variations that respect the geometry of the space. We apply techniques such as intrinsic and extrinsic statistics to enable maximum-likelihood classification. We also provide algorithms for unsupervised clustering derived from the geometry of the manifold. Finally, we demonstrate the improved performance of these methods in a wide variety of vision applications such as activity recognition, video-based face recognition, object recognition from image sets, and activity-based video clustering.
  • Keywords
    computational geometry; image recognition; maximum likelihood estimation; video signal processing; Grassmann Manifolds; Riemannian geometry; Riemannian metrics; Stiefel Manifolds; activity based video clustering; activity recognition; finite dimensional linear subspaces; geometric properties; image recognition; linear dynamic models; linear subspace structure; maximum likelihood classification; object recognition; statistical computations; unsupervised clustering; video based face recognition; video based recognition; Computational modeling; Data models; Face recognition; Geometry; Humans; Manifolds; Shape; Grassmann.; Image and video models; Stiefel; feature representation; manifolds; statistical models;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2011.52
  • Filename
    5740915