DocumentCode
2727171
Title
How Contents Influence Clustering Features in the Web
Author
Cheng, Xueqi ; Ren, Fuxin ; Cao, Xianbin ; Ma, Jing
fYear
2007
fDate
2-5 Nov. 2007
Firstpage
81
Lastpage
84
Abstract
In World Wide Web, contents of web documents play important roles in the evolution process because of their effects on linking preference. A majority of topological properties are content-related, and among them the clustering features are sensitive to contents of Web documents. In this paper, we first observe the impacts of content similarity on web links by introducing a metric called Linkage Probability. Then we investigate how contents influence the formation mechanism of the most basic cluster, triangle, with a metric named Triangularization Probability. Experimental results indicate that content similarity has a positive function in the process of cluster formation in theWeb. Theoretical analysis predicts the contents influence on the clustering features in the Web very well.
Keywords
Complex networks; Computer science; Couplings; Information retrieval; Joining processes; Web pages; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3026-0
Type
conf
DOI
10.1109/WI.2007.93
Filename
4427069
Link To Document