Title :
An Approach of Community Detecting Based on Block Level Link Analysis
Author :
Hui, Ding ; Yang, Nan
Author_Institution :
Inf. Sch., Renmin Univ. of China, Beijing, China
Abstract :
With the explosive increment of the information in the Web, how to efficiently understand collective behaviors and emerging phenomenon in the WWW is becoming a serious problem. Web community, which can be recognized as a set of web units, usually the pages, based on a common topic, is a significant structure in the Web. An efficient way to detect a community corresponding to a specific topic can greatly help users to obtain useful information. However, existing community detecting algorithms are all based on pages as units, the topic drift phenomenon becomes the inherent problem of each existing community detecting algorithm since the Web pages always have multiple topics in content. This paper puts forward a new community detecting algorithm based on blocks as units, which combines the page dividing algorithm with the primary community detecting algorithm, overcoming the defect in the existing community detecting algorithms. The communities obtained by the new algorithm have much more definite topic in content, which is more meaningful for further data processing such as information extraction and so on.
Keywords :
Internet; graph theory; Web community; Web pages; World Wide Web; block level link analysis; data processing; information extraction; page bipartite graph; page dividing algorithm; primary community detecting algorithm; topic drift phenomenon; Algorithm design and analysis; Bipartite graph; Data mining; Data processing; Explosives; Information analysis; Information systems; Oceans; Web pages; World Wide Web; block; community; page dividing; web;
Conference_Titel :
Web Information Systems and Applications Conference, 2009. WISA 2009. Sixth
Conference_Location :
Xuzhou, Jiangsu
Print_ISBN :
978-0-7695-3874-7
DOI :
10.1109/WISA.2009.40