DocumentCode :
1432764
Title :
A unified model for probabilistic principal surfaces
Author :
Chang, Kui-yu ; Ghosh, Joydeep
Author_Institution :
Interwoven Inc., Sunnyvale, CA, USA
Volume :
23
Issue :
1
fYear :
2001
fDate :
1/1/2001 12:00:00 AM
Firstpage :
22
Lastpage :
41
Abstract :
Principal curves and surfaces are nonlinear generalizations of principal components and subspaces, respectively. They can provide insightful summary of high-dimensional data not typically attainable by classical linear methods. Solutions to several problems, such as proof of existence and convergence, faced by the original principal curve formulation have been proposed in the past few years. Nevertheless, these solutions are not generally extensible to principal surfaces, the mere computation of which presents a formidable obstacle. Consequently, relatively few studies of principal surfaces are available. We previously (2000) proposed the probabilistic principal surface (PPS) to address a number of issues associated with current principal surface algorithms. PPS uses a manifold oriented covariance noise model, based on the generative topographical mapping (GTM), which can be viewed as a parametric formulation of Kohonen´s self-organizing map. Building on the PPS, we introduce a unified covariance model that implements PPS (0<α<1), GTM (α=1), and the manifold-aligned GTM (α>1) by varying the clamping parameter α. Then, we comprehensively evaluate the empirical performance of PPS, GTM, and the manifold-aligned GTM on three popular benchmark data sets. It is shown in two different comparisons that the PPS outperforms the GTM under identical parameter settings. Convergence of the PPS is found to be identical to that of the GTM and the computational overhead incurred by the PPS decreases to 40 percent or less for more complex manifolds. These results show that the generalized PPS provides a flexible and effective way of obtaining principal surfaces
Keywords :
convergence; probability; self-organising feature maps; Kohonen´s self-organizing map; generative topographical mapping; high-dimensional data; manifold oriented covariance noise model; parametric formulation; principal components; probabilistic principal surfaces; subspaces; unified covariance model; Clamps; Convergence; Data mining; Data visualization; Feature extraction; Independent component analysis; Linear discriminant analysis; Noise generators; Principal component analysis; Surface topography;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.899944
Filename :
899944
Link To Document :
بازگشت