DocumentCode :
808657
Title :
Nonlinear Dimensionality Reduction of Data Lying on the Multicluster Manifold
Author :
Meng, Deyu ; Leung, Yee ; Fung, Tung ; Xu, Zongben
Author_Institution :
Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an
Volume :
38
Issue :
4
fYear :
2008
Firstpage :
1111
Lastpage :
1122
Abstract :
A new method, which is called decomposition-composition (D-C) method, is proposed for the nonlinear dimensionality reduction (NLDR) of data lying on the multicluster manifold. The main idea is first to decompose a given data set into clusters and independently calculate the low-dimensional embeddings of each cluster by the decomposition procedure. Based on the intercluster connections, the embeddings of all clusters are then composed into their proper positions and orientations by the composition procedure. Different from other NLDR methods for multicluster data, which consider associatively the intracluster and intercluster information, the D-C method capitalizes on the separate employment of the intracluster neighborhood structures and the intercluster topologies for effective dimensionality reduction. This, on one hand, isometrically preserves the rigid-body shapes of the clusters in the embedding process and, on the other hand, guarantees the proper locations and orientations of all clusters. The theoretical arguments are supported by a series of experiments performed on the synthetic and real-life data sets. In addition, the computational complexity of the proposed method is analyzed, and its efficiency is theoretically analyzed and experimentally demonstrated. Related strategies for automatic parameter selection are also examined.
Keywords :
computational complexity; data visualisation; pattern clustering; computational complexity; data decomposition; data visualization; decomposition-composition method; embedding process; intercluster connections; intercluster topologies; intracluster neighborhood structures; multicluster manifold; nonlinear dimensionality reduction; Computational complexity; Data visualization; Employment; Information processing; Multimedia databases; Shape; Spatial databases; Stochastic processes; Topology; Visual databases; Data visualization; decomposition–composition method (D–C method); isometric feature mapping; multicluster manifold; nonlinear dimensionality reduction (NLDR); Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Models, Theoretical; Nonlinear Dynamics; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2008.925663
Filename :
4567545
Link To Document :
بازگشت