• DocumentCode
    2711009
  • Title

    Iterative Subgraph Mining for Principal Component Analysis

  • Author

    Saigo, Hiroto ; Tsuda, Koji

  • Author_Institution
    Max Planck Inst. for Inf., Saarbrucken
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    1007
  • Lastpage
    1012
  • Abstract
    Graph mining methods enumerate frequent subgraphs efficiently, but they are not necessarily good features for machine learning due to high correlation among features. Thus it makes sense to perform principal component analysis to reduce the dimensionality and create decorrelated features. We present a novel iterative mining algorithm that captures informative patterns corresponding to major entries of top principal components. It repeatedly calls weighted substructure mining where example weights are updated in each iteration. The Lanczos algorithm, a standard algorithm of eigen decomposition, is employed to update the weights. In experiments, our patterns are shown to approximate the principal components obtained by frequent mining.
  • Keywords
    data mining; data reduction; eigenvalues and eigenfunctions; graph theory; iterative methods; mathematics computing; principal component analysis; Lanczos algorithm; dimensionality reduction; eigen decomposition; iterative subgraph mining; machine learning; principal component analysis; weighted substructure mining; Approximation error; Covariance matrix; Cybernetics; Data mining; Indexing; Informatics; Iterative algorithms; Iterative methods; Least squares approximation; Principal component analysis; PCA; graph mining; lanczos algorithm; summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.62
  • Filename
    4781216