• DocumentCode
    2173244
  • Title

    A multi-scale generative model for animate shapes and parts

  • Author

    Dubinskiy, Aleksandr ; Zhu, Song Chun

  • fYear
    2003
  • fDate
    13-16 Oct. 2003
  • Firstpage
    249
  • Abstract
    We present a multiscale generative model for representing animate shapes and extracting meaningful parts of objects. The model assumes that animate shapes (2D simple dosed curves) are formed by a linear superposition of a number of shape bases. These shape bases resemble the multiscale Gabor bases in image pyramid representation, are well localized in both spatial and frequency domains, and form an over-complete dictionary. This model is simpler than the popular B-spline representation since it does not engage a domain partition. Thus it eliminates the interference between adjacent B-spline bases, and becomes a true linear additive model. We pursue the bases by reconstructing the shape in a coarse-to-fine procedure through curve evolution. These shape bases are further organized in a tree-structure, where the bases in each subtree sum up to an intuitive part of the object. To build probabilistic model for a class of objects, we propose a Markov random field model at each level of the tree representation to account for the spatial relationship between bases. Thus the final model integrates a Markov tree (generative) model over scales and a Markov random field over space. We adopt EM-type algorithm for learning the meaningful parts for a shape class, and show some results on shape synthesis.
  • Keywords
    Markov processes; computer animation; curve fitting; feature extraction; image reconstruction; image representation; splines (mathematics); B-spline bases; EM-type algorithm; Markov random field model; Markov tree model; animate shape representation; feature extraction; image pyramid representation; multiscale Gabor bases; multiscale generative model; shape reconstruction; tree representation; Animation; Dictionaries; Frequency domain analysis; Image reconstruction; Interference elimination; Markov random fields; Polynomials; Principal component analysis; Shape; Spline;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
  • Conference_Location
    Nice, France
  • Print_ISBN
    0-7695-1950-4
  • Type

    conf

  • DOI
    10.1109/ICCV.2003.1238350
  • Filename
    1238350