DocumentCode :
1241646
Title :
The Sum-over-Paths Covariance Kernel: A Novel Covariance Measure between Nodes of a Directed Graph
Author :
Mantrach, Amin ; Yen, Luh ; Callut, Jerome ; Francoisse, Kevin ; Shimbo, Masashi ; Saerens, Marco
Author_Institution :
IRIDIA, Univ. Libre de Bruxelles, Brussels, Belgium
Volume :
32
Issue :
6
fYear :
2010
fDate :
6/1/2010 12:00:00 AM
Firstpage :
1112
Lastpage :
1126
Abstract :
This work introduces a link-based covariance measure between the nodes of a weighted directed graph, where a cost is associated with each arc. To this end, a probability distribution on the (usually infinite) countable set of paths through the graph is defined by minimizing the total expected cost between all pairs of nodes while fixing the total relative entropy spread in the graph. This results in a Boltzmann distribution on the set of paths such that long (high-cost) paths occur with a low probability while short (low-cost) paths occur with a high probability. The sum-over-paths (SoP) covariance measure between nodes is then defined according to this probability distribution: two nodes are considered as highly correlated if they often co-occur together on the same - preferably short - paths. The resulting covariance matrix between nodes (say n nodes in total) is a Gram matrix and therefore defines a valid kernel on the graph. It is obtained by inverting an ntimes n matrix depending on the costs assigned to the arcs. In the same spirit, a betweenness score is also defined, measuring the expected number of times a node occurs on a path. The proposed measures could be used for various graph mining tasks such as computing betweenness centrality, semi-supervised classification of nodes, visualization, etc., as shown in Section 7.
Keywords :
covariance matrices; directed graphs; probability; Boltzmann distribution; Gram matrix; Nodes; centrality; covariance matrix; graph mining; link-based covariance measure; probability distribution; relative entropy; semi-supervised classification; sum-over-paths covariance kernel; visualization; weighted directed graph; Graph mining; betweenness measure; biased random walk; commute time distance; correlation measure; kernel on a graph; resistance distance; semi-supervised classification.; shortest path;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2009.78
Filename :
4815265
Link To Document :
بازگشت