Title :
A multi-scale generative model for animate shapes and parts
Author :
Dubinskiy, Aleksandr ; Zhu, Song Chun
Abstract :
We present a multiscale generative model for representing animate shapes and extracting meaningful parts of objects. The model assumes that animate shapes (2D simple dosed curves) are formed by a linear superposition of a number of shape bases. These shape bases resemble the multiscale Gabor bases in image pyramid representation, are well localized in both spatial and frequency domains, and form an over-complete dictionary. This model is simpler than the popular B-spline representation since it does not engage a domain partition. Thus it eliminates the interference between adjacent B-spline bases, and becomes a true linear additive model. We pursue the bases by reconstructing the shape in a coarse-to-fine procedure through curve evolution. These shape bases are further organized in a tree-structure, where the bases in each subtree sum up to an intuitive part of the object. To build probabilistic model for a class of objects, we propose a Markov random field model at each level of the tree representation to account for the spatial relationship between bases. Thus the final model integrates a Markov tree (generative) model over scales and a Markov random field over space. We adopt EM-type algorithm for learning the meaningful parts for a shape class, and show some results on shape synthesis.
Keywords :
Markov processes; computer animation; curve fitting; feature extraction; image reconstruction; image representation; splines (mathematics); B-spline bases; EM-type algorithm; Markov random field model; Markov tree model; animate shape representation; feature extraction; image pyramid representation; multiscale Gabor bases; multiscale generative model; shape reconstruction; tree representation; Animation; Dictionaries; Frequency domain analysis; Image reconstruction; Interference elimination; Markov random fields; Polynomials; Principal component analysis; Shape; Spline;
Conference_Titel :
Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on
Conference_Location :
Nice, France
Print_ISBN :
0-7695-1950-4
DOI :
10.1109/ICCV.2003.1238350