• DocumentCode
    1521383
  • Title

    Similarity Preserving Principal Curve: An Optimal 1-D Feature Extractor for Data Representation

  • Author

    Sun, Mingming ; Yang, Jian ; Liu, ChuanCai ; Yang, Jingyu

  • Author_Institution
    Dept. of Comput. Sci., Nanjing Univ. of Sci. & Technol., Nanjing, China
  • Volume
    21
  • Issue
    9
  • fYear
    2010
  • Firstpage
    1445
  • Lastpage
    1456
  • Abstract
    This paper discusses the problem of what kind of learning model is suitable for the tasks of feature extraction for data representation and suggests two evaluation criteria for nonlinear feature extractors: reconstruction error minimization and similarity preservation. Based on the suggested evaluation criteria, a new type of principal curve-similarity preserving principal curve (SPPC) is proposed. SPPCs minimize the reconstruction error under the condition that the similarity between similar samples are preserved in the extracted features, thus giving researchers effective and reliable cognition of the inner structure of data sets. The existence and properties of SPPCs are analyzed; a practical learning algorithm is proposed and high dimensional extensions of SPPCs are also discussed. Experimental results show the virtues of SPPCs in preserving inner structures of data sets and discovering manifolds with high nonlinearity.
  • Keywords
    curve fitting; feature extraction; image reconstruction; minimisation; SPPC; data representation; feature extraction; learning model; nonlinear feature extractor; optimal 1D feature extractor; reconstruction error minimization; similarity preserving principal curve; Algorithm design and analysis; Cognition; Computer science; Data mining; Feature extraction; Image reconstruction; Manifolds; Pattern recognition; Principal component analysis; Sun; Feature extraction; manifold learning; principal curve; reconstruction error; similarity preservation; Algorithms; Artificial Intelligence; Computer Simulation; Neural Networks (Computer); Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2010.2048577
  • Filename
    5491192