• DocumentCode
    2727171
  • Title

    How Contents Influence Clustering Features in the Web

  • Author

    Cheng, Xueqi ; Ren, Fuxin ; Cao, Xianbin ; Ma, Jing

  • fYear
    2007
  • fDate
    2-5 Nov. 2007
  • Firstpage
    81
  • Lastpage
    84
  • Abstract
    In World Wide Web, contents of web documents play important roles in the evolution process because of their effects on linking preference. A majority of topological properties are content-related, and among them the clustering features are sensitive to contents of Web documents. In this paper, we first observe the impacts of content similarity on web links by introducing a metric called Linkage Probability. Then we investigate how contents influence the formation mechanism of the most basic cluster, triangle, with a metric named Triangularization Probability. Experimental results indicate that content similarity has a positive function in the process of cluster formation in theWeb. Theoretical analysis predicts the contents influence on the clustering features in the Web very well.
  • Keywords
    Complex networks; Computer science; Couplings; Information retrieval; Joining processes; Web pages; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, IEEE/WIC/ACM International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3026-0
  • Type

    conf

  • DOI
    10.1109/WI.2007.93
  • Filename
    4427069