Title :
Scalable Seed Expansion for Identifying Web Communities
Author :
Han, Min ; Shen, Hong ; Zhang, Xianchao
Author_Institution :
Fac. of Electron. Inf. & Electr. Eng., Dalian Univ. of Technol., Dalian, China
Abstract :
We study the problem of identifying Web communities around some seed vertex. In this work, we propose a fast graph algorithm to expand Web communities in a scalable style. Given a seed vertex, our algorithm computes approximate personalized PageRank vectors with better and better approximations, and finds the smallest conductance sets on these vectors as candidate communities in nearly-linear time. At the end, it returns the candidate community with the smallest conductance as the result community. We also define local community profile (LCP) to investigate structural and statistical properties of Web communities in a local range. Theoretical analysis and primary experiments both show the efficiency of the proposed algorithm and the quality of the results.
Keywords :
social networking (online); statistical analysis; Web community identification; local community profile; personalized PageRank vectors; scalable seed expansion; seed vertex; statistical properties; structural properties; Algorithm design and analysis; Approximation algorithms; Approximation methods; Clustering algorithms; Communities; Educational institutions; Vectors;
Conference_Titel :
Parallel Architectures, Algorithms and Programming (PAAP), 2011 Fourth International Symposium on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4577-1808-3
DOI :
10.1109/PAAP.2011.64