• DocumentCode
    808657
  • Title

    Nonlinear Dimensionality Reduction of Data Lying on the Multicluster Manifold

  • Author

    Meng, Deyu ; Leung, Yee ; Fung, Tung ; Xu, Zongben

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an
  • Volume
    38
  • Issue
    4
  • fYear
    2008
  • Firstpage
    1111
  • Lastpage
    1122
  • Abstract
    A new method, which is called decomposition-composition (D-C) method, is proposed for the nonlinear dimensionality reduction (NLDR) of data lying on the multicluster manifold. The main idea is first to decompose a given data set into clusters and independently calculate the low-dimensional embeddings of each cluster by the decomposition procedure. Based on the intercluster connections, the embeddings of all clusters are then composed into their proper positions and orientations by the composition procedure. Different from other NLDR methods for multicluster data, which consider associatively the intracluster and intercluster information, the D-C method capitalizes on the separate employment of the intracluster neighborhood structures and the intercluster topologies for effective dimensionality reduction. This, on one hand, isometrically preserves the rigid-body shapes of the clusters in the embedding process and, on the other hand, guarantees the proper locations and orientations of all clusters. The theoretical arguments are supported by a series of experiments performed on the synthetic and real-life data sets. In addition, the computational complexity of the proposed method is analyzed, and its efficiency is theoretically analyzed and experimentally demonstrated. Related strategies for automatic parameter selection are also examined.
  • Keywords
    computational complexity; data visualisation; pattern clustering; computational complexity; data decomposition; data visualization; decomposition-composition method; embedding process; intercluster connections; intercluster topologies; intracluster neighborhood structures; multicluster manifold; nonlinear dimensionality reduction; Computational complexity; Data visualization; Employment; Information processing; Multimedia databases; Shape; Spatial databases; Stochastic processes; Topology; Visual databases; Data visualization; decomposition–composition method (D–C method); isometric feature mapping; multicluster manifold; nonlinear dimensionality reduction (NLDR); Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Models, Theoretical; Nonlinear Dynamics; Pattern Recognition, Automated;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2008.925663
  • Filename
    4567545