• DocumentCode
    3211310
  • Title

    A stochastic approach for modeling and computing Web communities

  • Author

    Greco, Gianluigi ; Greco, Sergio ; Zumpano, Ester

  • Author_Institution
    DEIS, Univ. della Calabria, Italy
  • fYear
    2002
  • fDate
    12-14 Dec. 2002
  • Firstpage
    43
  • Lastpage
    52
  • Abstract
    In the last few years, a lot of research has been devoted to developing new techniques for improving the recall and precision of current Web search engines. Few works deal with the interesting problem of identifying the communities to which pages belong. Most previous approaches tried to cluster data by means of spectral techniques or traditional hierarchical algorithms. The main problem with these techniques is that they ignore the fact that Web communities are social networks with distinctive statistical properties. We analyze Web communities on the basis of the evolution of an initial set of hubs and authoritative pages. The evolution law captures the behaviour of page authors with respect to the popularity of existing pages for topics of interest. Assuming such a model, we have found interesting properties of Web communities and have proposed a technique for computing relevant properties for specific topics. Several experiments have confirmed the validity of both the model and the identification method.
  • Keywords
    Internet; information needs; information resources; information retrieval; stochastic processes; Web community computing; Web community modeling; Web search engines; authoritative pages; hubs; identification method; page authors; precision; recall; social networks; statistical properties; stochastic approach; Bibliometrics; Clustering algorithms; Information retrieval; Organizing; Portals; Search engines; Social network services; Stochastic processes; Web pages; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Systems Engineering, 2002. WISE 2002. Proceedings of the Third International Conference on
  • Print_ISBN
    0-7695-1766-8
  • Type

    conf

  • DOI
    10.1109/WISE.2002.1181642
  • Filename
    1181642