DocumentCode :
2711009
Title :
Iterative Subgraph Mining for Principal Component Analysis
Author :
Saigo, Hiroto ; Tsuda, Koji
Author_Institution :
Max Planck Inst. for Inf., Saarbrucken
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
1007
Lastpage :
1012
Abstract :
Graph mining methods enumerate frequent subgraphs efficiently, but they are not necessarily good features for machine learning due to high correlation among features. Thus it makes sense to perform principal component analysis to reduce the dimensionality and create decorrelated features. We present a novel iterative mining algorithm that captures informative patterns corresponding to major entries of top principal components. It repeatedly calls weighted substructure mining where example weights are updated in each iteration. The Lanczos algorithm, a standard algorithm of eigen decomposition, is employed to update the weights. In experiments, our patterns are shown to approximate the principal components obtained by frequent mining.
Keywords :
data mining; data reduction; eigenvalues and eigenfunctions; graph theory; iterative methods; mathematics computing; principal component analysis; Lanczos algorithm; dimensionality reduction; eigen decomposition; iterative subgraph mining; machine learning; principal component analysis; weighted substructure mining; Approximation error; Covariance matrix; Cybernetics; Data mining; Indexing; Informatics; Iterative algorithms; Iterative methods; Least squares approximation; Principal component analysis; PCA; graph mining; lanczos algorithm; summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.62
Filename :
4781216
Link To Document :
بازگشت