DocumentCode :
3652130
Title :
Statistical Selection of Congruent Subspaces for Mining Attributed Graphs
Author :
Patricia Iglesias Sánchez;Emmanuel Müller;Fabian Laforet;Fabian Keller;Klemens Böhm
Author_Institution :
Karlsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
fYear :
2013
Firstpage :
647
Lastpage :
656
Abstract :
Current mining algorithms for attributed graphs exploit dependencies between attribute information and edge structure, referred to as homophily. However, techniques fail if this assumption does not hold for the full attribute space. In multivariate spaces, some attributes have high dependency with the graph structure while others do not show any dependency. Hence, it is important to select congruent subspaces (i.e., subsets of the node attributes) showing dependencies with the graph structure. In this work, we propose a method for the statistical selection of such congruent subspaces. More specifically, we define a measure which assesses the degree of congruence between a set of attributes and the entire graph. We use it as the core of a statistical test, which congruent subspaces must pass. To illustrate its applicability to common graph mining tasks and in order to evaluate our selection scheme, we apply it to community outlier detection. Our selection of congruent subspaces enhances outlier detection by measuring outlier ness scores in selected subspaces only. Experiments on attributed graphs show that our approach outperforms traditional full space approaches and gives way to better outlier detection.
Keywords :
"Communities","Data mining","Estimation","Social network services","Vectors","Image edge detection","Monte Carlo methods"
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2013.88
Filename :
6729549
Link To Document :
بازگشت