• DocumentCode
    3082961
  • Title

    Estimating mixture models of images and inferring spatial transformations using the EM algorithm

  • Author

    Frey, Brendan J. ; Jojic, Nebojsa

  • Author_Institution
    Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
  • Volume
    1
  • fYear
    1999
  • fDate
    1999
  • Abstract
    Mixture modeling and clustering algorithms are effective, simple ways to represent images using a set of data centers. However, in situations where the images include background clutter and transformations such as translation, rotation, shearing and warping, these methods extract data centers that include clutter and represent different transformations of essentially the same data. Taking face images as an example, it would be more useful for the different clusters to represent different poses and expressions, instead of cluttered versions of different translations, scales and rotations. By including clutter and transformation as unobserved, latent variables in a mixture model, we obtain a new “transformed mixture of Gaussians”, which is invariant to a specified set of transformations. We show how a linear-time EM algorithm can be used to fit this model by jointly estimating a mixture model for the data and inferring the transformation for each image. We show that this algorithm can jointly align images of a human head and learn different poses. We also find that the algorithm performs better than k-nearest neighbors and mixtures of Gaussians on handwritten digit recognition
  • Keywords
    clutter; computer vision; handwritten character recognition; motion estimation; Gaussians; background clutter; data centers; handwritten digit recognition; images estimation; k-nearest neighbors; linear-time EM algorithm; mixture models; rotation; shearing; spatial transformations; translation; warping; Clustering algorithms; Coherence; Gaussian noise; Gaussian processes; Head; Humans; Image recognition; Pixel; Shearing; Video sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on.
  • Conference_Location
    Fort Collins, CO
  • ISSN
    1063-6919
  • Print_ISBN
    0-7695-0149-4
  • Type

    conf

  • DOI
    10.1109/CVPR.1999.786972
  • Filename
    786972