DocumentCode
2711009
Title
Iterative Subgraph Mining for Principal Component Analysis
Author
Saigo, Hiroto ; Tsuda, Koji
Author_Institution
Max Planck Inst. for Inf., Saarbrucken
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
1007
Lastpage
1012
Abstract
Graph mining methods enumerate frequent subgraphs efficiently, but they are not necessarily good features for machine learning due to high correlation among features. Thus it makes sense to perform principal component analysis to reduce the dimensionality and create decorrelated features. We present a novel iterative mining algorithm that captures informative patterns corresponding to major entries of top principal components. It repeatedly calls weighted substructure mining where example weights are updated in each iteration. The Lanczos algorithm, a standard algorithm of eigen decomposition, is employed to update the weights. In experiments, our patterns are shown to approximate the principal components obtained by frequent mining.
Keywords
data mining; data reduction; eigenvalues and eigenfunctions; graph theory; iterative methods; mathematics computing; principal component analysis; Lanczos algorithm; dimensionality reduction; eigen decomposition; iterative subgraph mining; machine learning; principal component analysis; weighted substructure mining; Approximation error; Covariance matrix; Cybernetics; Data mining; Indexing; Informatics; Iterative algorithms; Iterative methods; Least squares approximation; Principal component analysis; PCA; graph mining; lanczos algorithm; summarization;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.62
Filename
4781216
Link To Document