• DocumentCode
    2933131
  • Title

    Exploiting Routing Information Encoded into Backlinks to Improve Topical Crawling

  • Author

    Mouton, Alban ; Marteau, Pierre-Francois

  • Author_Institution
    Valeria Eur. Univ. of Brittany, Vannes, France
  • fYear
    2009
  • fDate
    4-7 Dec. 2009
  • Firstpage
    659
  • Lastpage
    664
  • Abstract
    Local link analysis of topical graphs on the Web allows to experiment focused crawling strategies in a detailed way. In this scope, models, parameters and metrics used to orientate the crawler can be better understood, tuned and evaluated. We develop a methodological and experimental approach that exploits link analysis in order to determine what constitutes a good content analysis metric able to guide efficiently topical crawlers toward highly relevant areas of the Web. Our experimentations show that partial knowledge of the local topology of topical graph highlights our understanding of routing capabilities of various metrics. Furthermore, our experimentations demonstrate that significant crawling efficiency improvement can be reached.
  • Keywords
    Internet; graph theory; telecommunication network routing; telecommunication network topology; Web; backlinks; content analysis metric; link analysis; local link analysis; local topology; routing capabilities; routing information; topical crawling; topical graph; Computer applications; Constraint optimization; Containers; Design optimization; Integer linear programming; Laboratories; Pattern recognition; Printing; Routing; Testing; Topical crawling; Web topology; backlinks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Pattern Recognition, 2009. SOCPAR '09. International Conference of
  • Conference_Location
    Malacca
  • Print_ISBN
    978-1-4244-5330-6
  • Electronic_ISBN
    978-0-7695-3879-2
  • Type

    conf

  • DOI
    10.1109/SoCPaR.2009.129
  • Filename
    5370352