DocumentCode :
1388339
Title :
On the Spectral Characterization and Scalable Mining of Network Communities
Author :
Bo Yang ; Liu, Jiming ; Feng, Jianfeng
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Volume :
24
Issue :
2
fYear :
2012
Firstpage :
326
Lastpage :
337
Abstract :
Network communities refer to groups of vertices within which their connecting links are dense but between which they are sparse. A network community mining problem (or NCMP for short) is concerned with the problem of finding all such communities from a given network. A wide variety of applications can be formulated as NCMPs, ranging from social and/or biological network analysis to web mining and searching. So far, many algorithms addressing NCMPs have been developed and most of them fall into the categories of either optimization based or heuristic methods. Distinct from the existing studies, the work presented in this paper explores the notion of network communities and their properties based on the dynamics of a stochastic model naturally introduced. In the paper, a relationship between the hierarchical community structure of a network and the local mixing properties of such a stochastic model has been established with the large-deviation theory. Topological information regarding to the community structures hidden in networks can be inferred from their spectral signatures. Based on the above-mentioned relationship, this work proposes a general framework for characterizing, analyzing, and mining network communities. Utilizing the two basic properties of metastability, i.e., being locally uniform and temporarily fixed, an efficient implementation of the framework, called the LM algorithm, has been developed that can scalably mine communities hidden in large-scale networks. The effectiveness and efficiency of the LM algorithm have been theoretically analyzed as well as experimentally validated.
Keywords :
Internet; data mining; stochastic processes; Web mining; biological network analysis; heuristic method; hierarchical community structure; large-deviation theory; large-scale network; metastability; network community mining; network community mining problem; scalable mining; spectral characterization; stochastic model; topological information; Eigenvalues and eigenfunctions; Heuristic algorithms; Markov processes; Optimization; Social network services; Markov chain; Social network; community structure; large-deviation theory.; local mixing;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2010.233
Filename :
5645622
Link To Document :
بازگشت